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In  this  dissertation  an  indirect  adaptive  control  algorithm  is  presented,  which 
can  be  applied  to  linear  time-invariant  systems  as  well  as  to  linear  time-varying 
systems  with  rapidly  varying  parameters.  This  algorithm  does  not  require  persistent 
excitation.  A key  feature  of  this  algorithm  is  the  use  of  an  assumed  structure  of 
parameter  variations  which  enables  it  to  cover  a wide  class  of  linear  systems. 

A modification  of  the  least  squares  algorithm  is  used  for  the  parameter  estima- 
tion, which  has  a dead  zone  to  handle  robustness  to  the  unmodelled  plant  uncertain- 
ties or  external  disturbances.  A forgetting  factor  is  introduced  in  the  algorithm  to 
aid  in  the  tracking  of  time- varying  parameters.  The  forgetting  factor  used  is  based 
on  the  constant  information  principle.  The  control  law  used  is  due  to  Kleinman. 
The  advantage  of  this  method  is  that  it  can  be  applied  to  linear  time- varying  plants 
with  reachability  index  greater  than  the  system  order  n as  well  as  to  the  plants  with 
reachability  index  equal  to  n. 
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With  this  control  law  and  a state  observer,  all  based  on  the  parameter  esti- 
mates, it  is  shown  that  the  resulting  closed-loop  system  is  robustly  globally  stable. 
As  a design  example,  the  proposed  algorithm  is  applied  to  the  design  of  the  nor- 
mal acceleration  controller  of  missiles  with  rapidly  changing  parameters.  Excellent 
performance  is  demonstrated  by  the  proposed  algorithm. 
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CHAPTER  ONE 
INTRODUCTION 


Although  the  main  applications  of  adaptive  control  are  in  situations  where 
plant  characteristics  or  parameters  are  changing,  the  strongest  theoretical  results 
have  been  made  for  linear  time-invariant  plants.  Most  research  in  the  adaptive 
control  of  time- varying  systems  assumes  that  the  system  is  slowly  time- varying  [1,2]. 
Only  a few  papers  consider  time-varying  systems  which  are  not  necessarily  slowly 
time-varying.  First  results  in  this  direction  were  reported  by  Xianya  and  Evans 
[3].  They  modelled  the  unknown  parameter  variations  as  linear  curves.  Later, 
Zheng  [4]  replaced  linear  curves  by  the  truncated  Taylor  series.  This  motivated 
the  concept  of  the  structured  parameter  variations  which  play  an  important  role  in 
this  dissertation.  But  because  Xianya  and  Evans,  and  Zheng  chose  one-step-ahead 
controller  structure,  they  only  treated  a class  of  stably  invertible  systems.  Moreover, 
they  did  not  consider  the  robustness  problem  to  the  unmodelled  plant  uncertainties 
or  external  disturbances. 

More  recently,  Tsakalis  and  Ioannou  [5]  proposed  an  indirect  adaptive  pole 
placement  scheme  for  continuous  linear  time- varying  plants  which  may  be  fast  time- 
varying.  They  used  the  same  concept  of  structured  parameter  variations  as  this 
dissertation.  However,  it  is  not  clear  how  to  make  the  estimated  model  reachable  [5]. 
If  the  estimated  model  is  not  reachable,  the  closed-loop  stability  is  not  guaranteed. 
On  the  other  hand,  the  pole  placement  scheme  [5]  requires  that  the  plant  to  be 
controlled  should  be  reachable  in  n steps  where  n is  the  plant  order.  This  assumption 
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will  be  weakened  to  the  iV-step  reachability  where  N is  a positive  integer  with  N > n 
in  this  dissertation. 

There  are  two  main  difficulties  that  any  indirect  adaptive  control  scheme  should 
overcome.  One  is  the  reachability  condition  imposed  on  the  estimated  model.  In 
order  to  construct  a control  law  for  the  estimated  model,  it  should  be  reachable,  or 
at  least  stabilizable  if  the  plant  is  linear  time-invariant.  The  other  is  the  robustness 
problem  which  has  been  intensively  investigated  since  the  work  of  Rohrs  et  al.  [6]. 
The  closed-loop  should  be  stable  in  the  presence  of  unmodelled  dynamics  or  external 
disturbances. 

The  attempts  which  have  been  made  for  the  reachability  condition  are  as  fol- 
lows. In  earlier  works  of  indirect  adaptive  control  for  linear  time-invariant  plants, 
Samson  and  his  coworker  [7,  8,  9]  assumed  that  the  estimated  model  is  uniformly 
reachable  as  a generic  property.  But  later,  to  ensure  the  reachability  of  the  esti- 
mated model,  a convex  set  of  parameters  whose  element  corresponds  to  a system 
satisfying  this  condition  was  introduced.  A projection  scheme  then  was  used  to  keep 
the  parameter  estimates  within  this  set  [10,  11],  or  a correction  term  was  added  in 
the  standard  estimation  algorithm  to  drive  them  to  this  set  gradually  [12,  13]. 

Another  approach  is  to  use  persistent  excitation.  Persistent  excitation  even- 
tually makes  the  estimates  converge  to  a certain  tuned  parameter  matrix  which  is 
assumed  to  satisfy  the  reachability  condition.  However,  persistent  excitation  needs 
an  external  signal  which  is  sometimes  undesirable  from  a control  point  of  view  [14]. 

Another  alternative  is  to  use  search  techniques  in  the  parameter  space  in  parallel 
with  estimation  algorithm  to  provide  proper  substitutes  (call  it  modified  estimates) 
which  ensures  the  reachability  of  the  estimated  model  whenever  the  estimates  violate 
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the  reachability  condition  [15,  16].  Lozano-Leal  and  Goodwin  [16]  suggested  an 
algorithm  to  find  the  modified  estimates  which  converge  for  the  time-invariant  plant. 
But  when  applied  to  time- varying  plants  modified  estimates  may  not  converge  even 
though  the  estimates  converge. 

For  the  robustness  of  indirect  or  direct  adaptive  control  schemes,  several  at- 
tempts have  been  made.  The  first  one  is  to  produce  persistent  excitation  which  will 
give  the  tuned  model  exponential  stability.  This,  in  turn,  will  make  the  plant  stable 
in  the  presence  of  bounded  disturbances.  However,  guaranteeing  the  persistent  ex- 
citation in  the  presence  of  unmodelled  plant  uncertainties  still  remains  unresolved 

[171- 

The  second  one  is  cr-modification  proposed  by  Ioannou  and  Kokotovic  [18]  which 
is  a direct  adaptive  control  law  with  an  extra  term  — cr9,  a > 0.  It  was  shown  that 
robustness  to  the  plant  uncertainties  can  be  obtained  using  cr-modification  for  time- 
invariant  plants  [19]. 

The  third  one  is  signal  normalization  by  which  unmodelled  plant  uncertain- 
ties can  be  viewed  as  bounded  disturbances.  Middleton  and  Goodwin  showed  that 
robustness  to  the  plant  uncertainties  can  be  obtained  by  properly  normalizing  the 
signals  which  are  used  in  the  estimation  algorithm,  but  it  does  not  guarantee  the 
boundedness  of  the  parameter  estimates  [14]. 

The  fourth  one  is  to  use  a dead  zone.  When  the  prediction  error  is  less  than 
a certain  threshold  the  estimates  may  be  in  the  “wrong  direction”  due  to  the  poor 
information  spoiled  with  disturbances  or  unmodelled  plant  uncertainties.  This  can 
even  make  the  estimates  diverge.  Thus,  by  using  a proper  dead  zone  estimation  can 
be  always  in  the  “right  direction,”  and  robustness  to  the  plant  uncertainties  can  be 
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obtained.  Sometimes,  some  of  these  ideas  are  incorporated.  For  example,  a dead 
zone  with  signal  normalization  was  used  by  Kreisselmeir  [17]. 

In  this  dissertation,  we  consider  an  indirect  adaptive  control  scheme  for  discrete 
linear  time-varying  plants  which  are  reachable  in  N steps  where  N is  greater  than 
or  equal  to  the  plant  order,  and  not  necessarily  slowly  time-varying  nor  stably 
invertible.  It  is  assumed  that  the  plant  is  described  by  a linear  vector  difference 
equation  and  the  time  variation  in  coefficients  is  modelled  as  a linear  combination  of 
known  bounded  functions  with  unknown  coefficients  which  are  constant  or  allowed 
to  be  slowly  time-varying.  The  unknown  coefficients  are  then  estimated  by  an 
estimation  scheme  based  on  the  least  squares  algorithm,  but  properly  modified  as 
follows: 

• A projection  method  is  used  in  order  to  ensure  the  reachability  of  the  estimated 
model  [1,  10,  11]. 

• For  the  robustness  to  the  unmodelled  plant  uncertainties  and  external  distur- 
bances, all  the  signals  used  in  the  estmation  algorithm  are  normalized  with 
respect  to  the  norm  of  the  regression  vector  and  a dead  zone  is  used.  Signal 
normalization  will  make  the  size  of  dead  zone  bounded. 

• A forgetting  factor  is  used,  ft  will  make  the  estimation  algorithm  track  slowly 
time-varying  parameters  and  thus  the  unknown  coefficients  to  be  estimated 
will  be  allowed  to  be  slowly  time- varying.  In  consequence,  we  can  expect  that 
the  unmodelled  plant  uncertainties  will  be  smaller  than  otherwise.  This  is 
preferable  because  we  are  allowed  to  take  the  smaller  size  of  dead  zone  in  the 
estimation  algorithm. 
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In  fact,  these  modifications  are  not  new.  They  appear  frequently  in  the  liter- 
ature, separately  or  partially  together  with  the  least  squares  algorithm.  However, 
we  combine  all  of  them  to  make  a form  which  is  useful  in  practice,  and  give  the 
complete  convergence  analysis. 

As  a control  law,  a generalized  Kleinman’s  method  [20]  is  taken.  It  allows  us 
to  have  a weaker  assumption  about  reachability,  i.e.,  the  assumption  that  the  plant 
is  reachable  in  N steps  with  N > n,  where  n is  the  plant  order.  With  this  control 
law  and  a state  observer,  all  based  on  the  estimated  parameter  values,  it  is  shown 
that  the  resulting  closed-loop  system  is  robustly  globally  stable  in  the  sense  that 
all  signals  are  bounded  if  the  reference  input  is  bounded,  in  the  presence  of  plant 
uncertainties  and  external  disturbances. 

As  a design  example,  the  proposed  adaptive  control  algorithm  is  applied  to 
the  design  of  the  normal  acceleration  controller  of  a typical  missile  whose  dynamic 
pressure  (thus  plant  parameters)  changes  very  rapidly.  Excellent  performance  is 
demonstrated  by  the  proposed  algorithm. 

This  dissertation  is  organized  as  follows.  In  Chapter  Two,  we  review  some  basic 
adaptive  control  schemes  and  briefly  discuss  two  important  problems  of  adaptive 
control,  robustness  and  time- varying  plants  with  rapidly  varying  parameters,  which 
lead  to  main  subjects  of  this  dissertation. 

In  Chapter  Three,  a mathemetical  model  of  the  plant  with  assumptions  and 
the  control  objective  of  the  proposed  algorithm  are  presented.  In  Chapter  Four,  an 
estimation  algorithm  for  the  plant  parameters,  which  is  based  on  the  least  squares 
algorithm  for  multi-output  systems  but  includes  a forgetting  factor,  dead  zone  and 
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a projection  operator,  will  be  introduced  and  some  important  properties  of  this 
algorithm  will  be  shown. 

Chapter  Five  deals  with  a state  feedback  control  law  for  the  estimated  model 
which  is  based  on  Kleinman’s  law  and  a state  observer  for  the  plant,  all  based  on 
the  estimated  parameters.  With  the  parameter  estimation  algorithm  described  in 
Chapter  Four,  it  is  shown  that  we  can  construct  a bounded  sequence  of  feedback 
gains  which  makes  the  estimated  model  exponentially  stable.  In  Chapter  Six,  it 
is  shown  that  the  control  law  constructed  in  Chapter  Five  with  a state  observer, 
all  based  on  the  parameter  estimates,  the  closed-loop  system  is  globally  stable  and 
robust  to  the  unmodelled  plant  uncertainties  and  the  external  disturbances. 

In  Chapter  Seven,  the  proposed  algorithm  is  implemented  in  the  normal  ac- 
celeration controller  for  a typical  air-to-air  missile  and  the  performance  is  demon- 
strated. Many  practical  problems  are  discussed.  Finally,  conclusions  are  given  in 
Chapter  Eight. 


CHAPTER  TWO 
BACKGROUND 


In  this  chapter,  we  first  review  some  basic  adaptive  control  schemes,  especially 
Samson  and  Fuchs’  work  [7]  which  is  the  basis  of  this  dissertation.  After  that,  we 
briefly  discuss  two  important  problems  of  adaptive  control,  robustness  and  time- 
varying  plants  which  are  not  slowly  time-varying,  which  lead  to  main  subjects  of 
this  dissertation.  Since  the  aim  of  the  review  is  to  grasp  the  basic  ideas  of  adaptive 
control  algorithms,  we  only  consider  the  algorithms  for  LTI  (Linear  Time-Invariant) 
systems  which  were  reported  before  the  robustness  problem  was  raised.  In  fact,  these 
algorithms  bave  been  modified  to  get  better  robustness,  but  these  modifications  are 
not  included  in  this  review. 

2.1  Review  of  Basic  Adaptive  Control  Schemes 

We  begin  with  a simple  scheme  called  Gain  Scheduling  which  is  very  old  but  is 
still  used,  especially  for  handling  parameter  variations  in  the  flight  control  systems. 
The  concept  is  based  on  “frozen  coefficients.”  A family  of  controllers  is  designed  as 
a function  of  unknown  constant  plant  parameters.  Implementation  requires  direct 
measurement  or  estimation  of  these  parameters  and  a subsequent  adjustment  of 
the  controller.  The  resulting  controller  is  actually  time- varying  because  either  the 
measured  plant  parameters  or  the  estimated  values  are  usually  not  constant. 

In  general,  a control  scheme  is  said  adaptive  if  it  consists  of  two  parts,  a pa- 
rameter estimator  which  estimates  plant  parameters  or  control  parameters  and  a 
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controller  which  is  adjusted  with  the  estimated  values  of  plant  parameters  or  con- 
trol parameters.  In  consequence,  there  are  two  feedback  loops  in  an  adaptive  control 
system.  The  inner  loop  is  an  ordinary  feedback  loop  with  a plant  and  a controller. 
The  outer  loop  is  an  adaptation  mechanism  which  adjusts  the  controller. 

In  gain  scheduling,  there  is  no  feedback  to  compensate  for  an  incorrect  schedule. 
Therefore,  according  to  the  above  definition,  it  is  not  included  in  the  category  of 
adaptive  control  algorithms. 

Even  though  gain  scheduling  has  been  used  for  some  time,  it  was  not  until  the 
late  1970s  that  the  stability  problem  was  rigorously  treated  in  the  context  of  slowly 
time- varying  systems  [21].  The  stability  for  time- varying  systems  which  are  not 
slowly  time- varying  is  not  guaranteed  with  gain  scheduling. 

Now,  we  turn  to  the  modern  adaptive  control  schemes.  In  general,  an  adaptive 
control  scheme  is  called  direct  if  control  parameters  are  directly  updated,  and  it 
is  called  indirect  if  plant  parameters  are  estimated  and  using  these  parameter  esti- 
mates control  parameters  are  obtained.  There  are  two  basic  problem  formulations 
in  modern  adaptive  control.  The  one  is  the  MRAC  (Model  Reference  Adaptive 
Control)  and  the  other  is  the  STC  (Self-Tuning  Control). 

In  the  MRAC,  a reference  model  is  properly  selected  and  the  control  parameters 
are  adjusted  such  that  the  error  between  the  model  output  and  the  plant  output 
becomes  small.  This  scheme  requires  that  the  plant  to  be  controlled  is  stably  in- 
vertible, because  this  approach  is  based  on  the  cancellation  of  zeros  of  the  plant.  If 
control  parameters  are  directly  updated,  it  is  called  a direct  MRAC,  otherwise  it 
is  called  an  indirect  MRAC.  The  research  in  this  direction  has  been  done  mainly 
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for  direct  MRACs  of  continuous  LTI  systems.  A typical  example  is  an  algorithm 
published  by  Narendra  and  Valavani  [22]. 

The  idea  of  STC  originated  from  the  minimum  variance  self-tuner  which  was 
developed  to  solve  the  stochastic  regulation  problem  for  unknown  discrete  time- 
invariant  systems  [23].  In  minimum  variance  self-tuners,  the  control  parameters  are 
estimated  by  a recursive  parameter  estimator  such  that  the  variance  of  the  output 
is  minimized.  However,  thereafter  many  researchers  have  widely  extended  this  con- 
cept, e.g.,  pole  placement  self-tuners  [24],  model  reference  self-tuners,  LQG  (Linear 
Quadratic  Gaussian)  self-tuners  [25]  and  so  on.  The  terminology  STC  is  used  as 
having  the  almost  same  meaning  as  adaptive  control.  Usually,  the  direct  approach 
is  called  an  implicit  STC  and  the  indirect  approach  is  called  an  explicit  STC.  In  the 
case  of  implicit  STC,  stability  is  guaranteed  if  the  plant  to  be  controlled  is  minimum- 
phase.  In  contrast  to  MRAS,  the  minimum-phase  condition  comes  from  a desire  to 
make  the  estimation  model  linear  in  the  parameters.  For  nonminimum-phase  plants, 
the  estimation  model  becomes  nonlinear  in  parameters  to  be  estimated,  which  makes 
the  parameter  estimation  very  difficult.  To  solve  this  problem,  nonlinear  estimation 
schemes  have  been  suggested  [26].  Elliot  proposed  a model  for  parameter  estima- 
tion which  is  linear  in  parameters  using  the  Bezout  identity  [27].  However,  it  has 
been  reported  by  Johnson,  Lawarence  and  Lyons  [28]  that  when  the  plant  order  is 
underestimated  Elliot’s  algorithm  can  fail  to  work. 

Kreisselmeir’s  indirect  adaptive  regulator  [29]  is  another  direction  of  research.  It 
consists  of  an  adaptive  state  observer  and  an  asymptotic  feedback  matrix  synthesis. 
Samson’s  indirect  approach  [8]  is  similar  to  Kreisselmeir’s  in  that  it  also  uses  an 
adaptive  state  observer,  but  the  control  law  is  constructed  in  a different  way.  In 
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Samson’s  approach,  feedback  gains  are  obtained  by  solving  one  step  of  a Riccati 
difference  equation  at  each  time.  This  approach  is  also  used  by  Samson  and  Fuchs 
[7]  and  Ossman  and  Kamen  [13].  Since  Samson  and  Fuchs’  work  [7]  is  the  basis  of 
our  adaptive  control  algorithm,  we  will  treat  more  details  in  Section  2.5. 

2.2  An  Example  of  Direct  MRAC 

Consider  the  following  SISO  (Single  Input  Single  Output)  plant 

= (l~dB{q~x  )u(k)  (2.1) 

where  q_1  is  the  unit  delay  operator,  u(k)  and  y(k ) denote  the  input  and  output, 
respectively,  and  A(q~x)  and  B(q~1)  are  given  by 


Mq-1) 

= 1 + o,\q  1 + a^q  2 + • • 

1 71 

1 ^71? 

(2.2) 

B(q-1) 

= + b\q  1 + &2<7  2 + • 

• • + bmq  m 

(2.3) 

6o7^0;  d > 1 ; m + d < n. 

We  need  the  following  assumptions: 

Al.  An  upper  bound  for  n is  known. 

A2.  d is  known. 

A3.  The  plant  is  minimum  phase,  i.e.,  B(z~1)  is  Hurwitz. 

A4.  bo  is  known. 

Remark  2.1.  For  simplicity,  we  impose  A4  as  [30].  However,  Assumption  A4 
can  be  weakened  if  the  sign  of  b0  and  an  upper  bound  for  the  magnitude  of  bo  is 
known.  See,  for  example,  Goodwin  and  Sin  [11]. 

Let  the  reference  model  be  given  by 


Am{q  l)y™(k)  = q dBm(q  1)r(k) 


(2.4) 
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where  r(k)  and  ym(k)  denote  the  reference  input  and  output,  respectively.  We 
further  assume  that 

deg(Am{z))  = deg(A(z)) 

where  deg(A(z))  denotes  a degree  of  a polynomial  A{z). 

Suppose  that  the  control  law  is  of  the  form 

R(q~l)u{k)  = T(q~1)r(k)  - S(q~1)y(k)  (2.5) 

where  the  coefficients  of  T(q~x)  and  S^q-1)  are  to  be  determined  so  that  the 

closed-loop  has  the  same  transfer  function  as  the  reference  model  (2.4).  By  equating 
the  closed-loop  transfer  function  to  that  of  the  reference  model,  we  have 

.r-'UrMBtr1)  r^-Or1) 

A(q-')R(q-i)  4 /U.(9-‘) 

which  gives  a solution: 

nr1)  = em(r')  (2.7) 

Ttfr'W?-1)  + rds(q-')  = Am(q~')  (2.8) 

with 

R(q-')  = B(r')n(q~')  (2.9) 

where 


Am(q  x)  = 1 + cxq  1 + b cnq  n 

Siq*1)  = s0  + si^-1  + b sn-i<Tn+1 

R{q  X)  = r0  + rxq  1 + • • • + rm+d-iq  m d+1 
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From  (2.8)  and  (2.9),  we  see  that  r0  = b0.  Multiplying  y{k ) on  both  sides  of  (2.8) 
and  using  (2.1)  gives 

q~d  R(q~x)u(k)  + q~dS(q~l)y(k)  = Am{q~l)y{k).  (2.10) 

Let  S{(k  — 1),  i = 0, ...,  re  — 1 and  r,(fc  — 1),  i — 1, ...,  m + d — 1 denote  the  estimated 
values  of  S{,i  = 0 ,...,re  — 1 and  r,-,z  - l,...,m  + d — 1 at  time  A:  — 1,  respectively. 
Then  the  error  equation 

e(k)  = Am(q~1)y(k)  - b0u(k  - d)  - cj>\k  - l)0(k  - 1)  (2.11) 

where 

4>'{k—  1)  = [ y(k  — d)  y(k  — n — d + 1) 

u(k  — d — 1)  • • • u(k  — m — 2d  + 1)  ] (2.12) 

O'(k-l)  = [s0(k  - l)---sn_1(k  - l)ri(k  - 1)  • • • rm+d_i(A:  - 1)].  (2.13) 

is  used  to  calculate  9(k)  recursively  (see  Section  2.4  for  estimation  algorithms).  From 
the  equation  (2.5),  the  control  input  based  on  the  current  estimate  6(k)  is  given  by 

i / m+d—1  n—  1 \ 

u(*0  = t(-  H ri(k)u(k-i)  + T(q~1)r(k)  - s,-(fc)y(A:  - t)  . (2.14) 

bo  \ i= 1 i~0  ) 

2.3  An  Example  of  Indirect  Pole  Placement  Adaptive  Control 
Consider  the  plant 

A(q~l)y{k)  = q~dB(q~1)u(k ) (2.15) 

where  u(k ) and  y(k)  denote  the  input  and  output,  respectively,  and  A{q~l)  and 
B(q~x)  are  polynomials  in  q~l  given  by 

A(q  x)  = 14 - d\q  1 0-2 q 2 4-  ■ • • 4"  o,nq  n (2.16) 


B(q  J)  — b\q  1 4-  &2<?  2 4-  • • • 4-  bmq  m. 


(2.17) 
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We  need  the  following  assumptions: 

Al.  r = max{n,ra}  is  known. 

A2.  A(q~l)  and  B(q~1)  are  coprime. 

Compare  these  assumptions  with  those  of  the  previous  example.  From  A2,  we  know 
that  there  exist  polynomials  E(q~1)  and  F(<?-1)  which  satisfy 

A(r')E(«-‘)  + B(q-')F(q-')  = A'(q-')  (2.18) 

where  A*(q~x)  is  a desired  characteristic  polynomial  of  order  2r  — 1.  The  pole 
placement  control  law  is  then  given  by 

F(g_1)u(fc)  = F(9_1)(r(A:)  - y(k))  (2.19) 

where  r(k ) is  the  reference  input.  Let  a,(fc  — 1)  and  bi{k  — 1)  be  the  estimates  of 
di  and  b{  at  time  k — 1,  respectively.  Then,  in  indirect  approach,  the  error  equation 
(called  prediction  error  equation)  is  written  by 

e{k)  = y{k)-(t>'(k-l)6(k-l)  (2.20) 

where 

<f>\k  — 1)  = [ y(k  — 1)  • • • y(k  — r)  u(k  — 1)  • • • u(k  — r)  ] (2.21) 

6\k  - 1)  = [ai(Jfc  - 1)  • • • ar(k  - 1)  W(k  - 1)  • • • br(k  - 1)  ].  (2.22) 

The  equation  (2.20)  is  used  to  calculate  6(k)  (see  the  next  section  for  estimation 
algorithms).  Now,  A{q~l)  and  B(q~ 1)  with  coefficients  replaced  by  estimated  pa- 
rameters may  not  be  coprime,  which  will  make  it  difficult  to  calculate  control  inputs. 
To  ensure  the  coprimeness  of  the  estimated  A(q~x)  and  F(g_1),  several  methods  ex- 
plained in  Chapter  One  (for  example  a projection  scheme)  can  be  used. 
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2.4  Basic  Estimation  Algorithms 

When  e(k)  denotes  the  error  equation  (2.11)  or  (2.20),  The  following  2 algo- 
rithms are  commonly  used  to  calculate  6(k)  in  adaptive  control  algorithms  [11,  31]: 

Gradient  Algorithm. 


6(k)  = 6(k-  1)  + 


(j>(k  — l)e(£) 
c + <f>'(k  — 1 )(f>(k  — 1) 


; c > 0. 


(2.23) 


Least  Squares  Algorithm. 


6(k)  = 6(k  - 1)  + P(k)4>(k  - l)e(fc)  (2.24) 


P{k)  = P(k-  1) 


P(k  - l)<f>(k  - l)^(Jb  - 1 )P(k  - 1) 
1 + (f)'(k  — l)P(k  — l)<f>(k  — 1) 


P( 0)  = P'(0)  > 0. 


(2.25) 


2.5  Samson  and  Fuchs’  Work 

In  this  section,  we  briefly  review  Samson  and  Fuchs’  Work  [7]  which  is  the  basis 
of  this  dissertation.  Consider  a SISO  LTI  plant  described  by 

y(k)  = ^2  a«  y(k  - 0 + H ~ *)  (2-26) 

«=i  «=i 

where  u(k),  y(k)  denote  the  scalar  input  and  output,  respectively,  p and  q denote 
fixed  positive  integers,  and  a,,  bt  are  constant  coefficients.  We  need  following  as- 
sumptions: 

Al.  n = max{p,  q}  is  known. 

A2.  The  plant  is  stabilizable. 
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The  equation  (2.26)  can  be  written  as 

y(Jfe)  = - 1)9  (2.27) 

where 

< i>'(k  — 1)  = [ y{k  — 1)  • • • y{k  — n ) u(k  — 1)  • • ■ u(k  — n ) ] (2.28) 

9'  = [a,  •••  an  bi  •••  bn].  (2.29) 

Note  that  a,-  = 0 if  i > p and  6,-  = 0 if  i > q.  Then,  the  prediction  error  equation  is 
given  by 

e(k)  = y(k)-(f>,(k-l)6(k-l)  (2.30) 

where  <f)(k)  is  as  in  (2.28)  and  9(k  — 1)  is  the  parameter  estimate  at  time  k — 1 
described  as 

0'(k  — 1)  = [ a\(k  - 1)  ...  an(k  — 1)  bi(k  - 1)  • • • bn(k  — 1)  ].  (2.31) 

To  obtain  the  parameter  estimate  9(k),  a class  of  estimation  algorithms  which 
satisfy  the  following  properties  are  considered: 

PI.  |e(A:)|  < ci(k)\\(j)(k  — 1)||  + c2(fc)  (2.32) 

where  (ci(A:)}  and  (c2(fc)}  are  uniformly  bounded, 

c\(k ) —*  0,  c2(A:)  — > 0,  as  k — > oo. 

P2.  ||0(jb)||  < M VJfc,  for  some  M.  (2.33) 

P3.  9(k)  converges  to  a constant  vector.  (2.34) 

The  least  squares  algorithm  given  in  Section  2.4  satisfies  all  of  these  properties  for 
the  plant  model  (2.27)  (see,  for  details,  [8]). 
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Let  x(k ) denote  the  estimated  state  vector  of  the  plant.  Then  a state  observer 
can  be  constructed  as 


where 


x(k  + 1)  = F(6(k))x{k)  + G(9(k))u(k)  + M(k)v(k) 


(2.35) 


v(k ) = y(k)  — Hx(k ) 


(2.36) 


nm)  = 


a\ (k)  1 0 • • • 0 

a2(k)  0 1 • • • 0 

an-i(k)  0 0 • • • 1 

an(k ) 0 0 • • • 0 


(2.37) 


GW))  = 


h(k) 

h{k) 

bn— i {.k) 

bn(k) 


(2.38) 


77  = [ 1 0 • • • 0 ]. 


(2.39) 


M{k)  is  selected  such  that  all  of  the  observer  poles  are  located  at  the  origin,  i.e., 


M'{k)  = [ai(k)a2(k)  ■■■  a„(&)]. 


(2.40) 


With  this  setup,  we  have  the  following  theorem  from  [7]. 
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Theorem  2.1.  With  the  parameter  estimation  algorithm  which  satisfies  3 prop- 
erties described  above,  with  any  control  law  L{k ) which  is  bounded  and  makes  the 
system 

F(0(k))  - G(6(k))L(k) 

exponentially  stable,  the  feedback  controller 

u(k)  = —L{k)x(k) 

makes  the  resulting  closed-loop  system  globally  stable. 

2.6  Robustness  to  Unmodelled  Dynamics 

In  the  early  1980s,  it  was  argued  that  existing  adaptive  control  algorithms 
could  not  be  used  with  confidence  in  practical  designs  where  the  plant  contained 
unmodelled  dynamics  [6].  The  problem  is  that  external  disturbances  or  unmodelled 
dynamics  can  cause  the  estimated  parameters  to  grow  without  bound  [6].  Since 
then,  considerable  research  has  been  done  to  get  better  understanding  about  the 
robustness  of  adaptive  control  algorithms.  The  following  simple  example  shows  how 
the  estimated  parameters  can  diverge. 

Consider  the  SISO  plant  model 

y{k)  = <f>'(k-l)0  + 6(k)  (2.41) 

with  bounded  external  disturbances  6(k).  Using  the  prediction  error  e(k)  given  in 
(2.30)  and  the  least  squares  algorithm  (2.24)  we  can  get  (for  details,  see  [32]) 

*(*)  = «+(£ (0  + r-'(0))-‘(£ 4>(i)S(i  + 1)  + />-’(O)(0(O)  - *)).  (2.42) 

i=0  i= 0 
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Let  4>(i)  = 1 /(*  + 1),  P( 0)  = 1 and  <5(i)  = 1.  Then  since 

+ P-'  (0)  -*  £ + 1 

feS  6 

and 


t- 1 

^<£(i)6(i  + l)  -»  oo 
i=0 

0(A:)  diverges  (see  the  Riemann  zeta  function  for  the  value  of  SiSi  i-2)- 


2.7  Time- Varying  Plants 


It  is  well  known  that  a linear  time- varying  system 

x(k  + 1)  = A(k)x(k) 


may  not  be  stable,  even  though  all  eigenvalues  of  A(k)  have  magnitudes  less  than 
one.  The  following  is  an  example. 

Example  2.1.  Define 


odd(k)  = < 


1 

0 


if  k is  odd 
otherwise 


and  consider  the  plant 


y(k  + 1)  = A(k)y(k)  + Bu(k)  (2.43) 

where 


m = 

0 2 odd{k ) 

, B(k)  = 

1 

2 odd(k  + 1)  0 

1 

All  eigenvalues  of  A(k)  are  zero  at  any  k,  but  this  plant  is  unstable.  It  implies 
that  this  plant  is  not  slowly  time- varying  (see  [33]  for  details  of  slowly  time- varying 
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systems)  and  any  controller  based  on  the  frozen  coefficients  will  fail  to  make  this 
plant  stable.  Therefore,  any  adaptive  controller  based  on  the  frozen  coefficients  will 
also  fail  to  work  for  the  plant  with  parameter  variations  like  this  example.  The  use 
of  the  structure  of  parameter  variations  is  a possible  solution  for  this  problem,  which 
is  one  of  main  features  of  the  proposed  adaptive  control  algorithm. 

2.8  Choice  of  Indirect  Approach  over  Direct  Approach 

As  mentioned  in  Section  2.1,  direct  approaches  require  a strong  assumption  that 
the  plant  to  be  controlled  is  stably  invertible,  while  this  assumption  is  not  necessary 
for  indirect  approaches.  As  far  as  we  know,  there  are  no  theories  which  support 
the  stability  of  direct  adaptive  control  for  time- varying  plants  which  are  not  slowly 
time- varying  in  the  presence  of  unmodelled  plant  uncertainties.  We  therefore  take 
an  indirect  approach. 


CHAPTER  THREE 

PLANT  MODEL  AND  THE  CONTROL  OBJECTIVE 


3.1  Plant  Model  and  Assumptions 

We  consider  a single  input  m-output  discrete  linear  time- varying  plant  described 

by 

y(k)  = Ai{k)y(k  ~ 0 + Yh  Bi(k)u(k  - i)  + d(k)  (3.1) 

t=i  i=i 

where  u(k),  y(k ) denote  the  scalar  input  and  m-dimensional  output,  respectively, 
d(k ) denotes  the  external  disturbance,  p and  s denote  fixed  positive  integers,  and 
Ai(k ),  B{(k ) are  time- varying  matrices  which  have  proper  dimensions. 

Assumption  1 : The  external  disturbance  d(k)  is  bounded,  i.e., 

sup{||d(fc)||}  < W\  for  some  W\.  (3-2) 


Entries  of  A{(k)  and  B{(k)  are  assumed  to  be  linear  combination  of  certain  known 
functions,  i.e.,  the  structure  of  the  plant  parameter  variations  are  known  a priori. 
We  will  call  this  type  of  parameter  variations  Structured  Parameter  Variations  as 
Tsakalis  and  Ioannou  [5].  In  practice,  there  may  be  errors  in  modelling  A{(k)  and 
B{(k)  by  structured  parameter  variations.  Therefore,  we  can  write  A,(fc)  and  B{(k) 
in  the  form 

Ai{k)  = J2(Aij  + AAij(k))f:j(k)  for  i = l,...,p  (3.3) 

j= i 

Bi(k)  = £(5,1  + A Bij(k))fj(k ) for  i = l,...,s  (3.4) 

j= i 
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where  A,j  and  Bt]  are  constant  matrices  (in  practice,  they  are  allowed  to  be  bounded 
slowly  time-varying  matrices  if  the  estimation  algorithm  can  track  time-varying 
parameters),  r is  a fixed  positive  integer,  fj(k)  are  known  functions  of  k which  are 
linearly  independent  and  bounded,  for  j = and  A Aij(k)  and  ABij(k)  are 

terms  due  to  modelling  errors.  We  will  call  A A,j(fc)  and  A Bij(k)  Unmodelled  Plant 
Uncertainties  or  simply  Plant  Uncertainties. 

Now  we  define  an  ( mp-\-s)r  x m parameter  matrix  9 , an  ( mp-\-s)r  x 1 regression 
vector  <f>(k)  and  an  ( mp  + s)r  x m plant  uncertainty  matrix  6(k)  by 

9'  — [An  • • • A\T  ■ • ■ Ap i • • • AprBn  ■ ■ ■ Bsr\  (3-5) 

<t>'{k  — 1)  - [h(k)y'(k-l)--- fT(k)y\k -!)■■■  Mk)y'(k-p)--- 

fT(k)y'{k  - p ) fi(k)u(k  - 1)  • • • fr(k)u(k  -l)--- 
f1(k)u(k~s)..-fr(k)u(k-s)]  (3.6) 

S\k)  = [AAu(fc)  • • • AAiT(k)  ■ ■ ■ AApl(fc)  • • • AApr(k) 

ABu{k)---ABsr{k)]  (3.7) 

where  “ ' ” denotes  the  transpose  of  a matrix.  Then  (3.1)  can  be  written  in  a 
compact  form 

y{k)  = {9  + 6{k)Y<f>{k  - 1)  + d{k)  (3.8) 

In  the  presence  of  unmodelled  plant  uncertainties,  there  is  no  9 £ 7^(mp+s)rxm 
which  satisfies 

y(k)  = 9'<j)(k-  1)  + d(k). 

Thus,  we  need  to  define  a tuned  parameter  matrix  which  will  be  regarded  as  a 
reference  for  the  estimation  instead  of  the  true  parameters.  We  define  a tuned 
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parameter  matrix  9*  by  a constant  matrix  satisfying 

\\S(k)\\  due  to  r < \\S(k)\\  due  to  any  0 G ft(mp+s)rXm. 

where  ||  • ||  denotes  the  Euclidean  norm.  From  now  on,  6(k)  due  to  9 * will  be  denoted 

by  S*(k).  Note  that  it  is  not  necessary  for  the  definition  of  the  tuned  parameters  to 

be  unique.  See  [34]  for  a different  definition. 

Assumption  2:  Positive  integers  p,s,r  are  assumed  known.  Moreover,  p < n, 
where  n is  the  plant  order. 

Assumption  3:  The  tuned  parameter  matrix  9*  satisfies 

sup{||^*(A:)||}  < D\  for  some  D\  > 0.  (3-9) 

To  construct  a state  feedback  control  law,  we  need  a state  model  of  (3.1).  We 
will  take  an  observable  realization  for  this  purpose.  We  first  consider  the  case  that 
p > s.  When  we  define  n as 

n = mp  (3.10) 

an  observable  realization  [35,  36]  of  (3.1)  is  given  by 

x(k  + 1)  = F{k)x{k)  + G{k)u(k)  + £(k  + l)  (3.11) 

y(k)  = Hx(k)  (3.12) 

where  an  n x n matrix  F(k),  an  n x 1 matrix  G(k)  and  an  m x n matrix  H are  in 


the  form 
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m = 


^(Jb  + l)  I 0---0 

Bi(k  + 1) 

A2(k  + 2)  0 /•••  0 

B2(k  + 2) 

| 

, G(k)  = 

: 

Ap^k  + p-l)  0 0 •••/ 

i 

's- 

+ 

1 

t—1 

Ap(k  + p)  0 0---0 

Bp(k  + p) 

H = [70---0  ] 


and 


m = h'  d(k). 


(3.13) 


Note  that  if  * > s Bi(k  + i)  = 0. 

Now  consider  the  case  that  p < s.  If  we  define  n as 


n — ms 


and  take  an  observable  realization,  the  resulting  state  model  will  not  satisfy  the 
reachability  condition  because  the  order  of  a reachable  realization  should  not  be 
greater  than  mp.  This  leads  to  the  following  lemma  which  gives  a realization  with 
the  order  mp.  For  convenience  of  notation,  the  summation  will  be  taken  as 

zero  if  < t0. 

Lemma  3.1.  Suppose  that  for  q < p, 

p p+? 

y(k)  = £ Mk)y(k  - i)  + E Bi(k)u(k  -i)  + d(k)  (3.14) 

t=i  i=i 

and  Ap(k)  is  invertible  for  any  k.  Then  there  is  a realization  of  (3.14)  described  by 
x(fc+l)  = F(k)x(k)  + G(k)u(k)  + £(k  + 1)  (3.15) 

y(k)  = Hx(k) + ^2  Di(k)u(k  - i) 

t=i 


(3.16) 
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where  an  n x n matrix  F(k),  an  n x 1 matrix  G{k ) and  an  m x n matrix  H are  in 
the  form 


A\{k  + 1) 

I 

0-- 

•0 

Gx{k+  1) 

A2{k  + 2) 

0 

I- 

•0 

G2(k  + 2) 

; 

, G(k)  = 

; 

Ap^i(k  + p-  1) 

0 

0- 

• / 

Gp-i(k  + p — 1) 

Ap(k  + p) 

0 

0- 

•0 

Gp(kAp) 

H = [ / 0 - - - 0 ] 

and 

£(fc)  = H'  d{k) 

Di(k  — p)  = -A-* l(k){Bp+i{k)  + J2Ap-j(k)Di+j(k  -p  + j)) 

j= i 

for  i = 1, . . . , q (3-17) 

Gi(k)  - Bi(k)  - Di(k)  + J2Aj{k)Di_:(k  - j) 

i= i 

for  i = 1,  • • • ,p,  Di(k)  = 0 if  i > q.  (3.18) 

Proof(Outline).  Using  (3.15)  and  (3.16),  express  ?/(&)  in  terms  of  y(k  — i ), 

i = l,...,p.  Equating  this  result  to  (3.14)  gives  (3.17)  and  (3.18).  ■ 

Remark  3.1.  In  Lemma  3.1,  the  constraint  that  q < p was  used  just  because  it 
was  easy  to  handle.  The  case  where  q > p can  be  treated  in  a similar  way. 

Using  this  lemma,  we  can  find  F(k),  G(k)  and  D{(k),  i = 1 ,...,<?  from  Ai(k), 

i = 1,  ...,p  and  Bi(k ),  * = 1,  ...,p  -f  q.  First,  we  find  Dq(k  — p)  from  (3.17),  i.e., 


D,(k  - p)  = -A;l(k)Br+,(k). 


25 


Since  we  know  the  structure  of  the  parameter  variations,  Dq(k  — p + 1)  follows  from 
Dq(k  — p).  Then,  using  Dq(k  — p -f  1),  we  obtain  Dq_\{k  — p)  from  (3.17).  Similarly, 
Di{k  — p),i  = 1,  ...,<7  — 2 can  be  obtained.  Now,  the  simple  case  where  m = 1 is 
given  in  the  following  corollary.  This  corollary  is  not  obtained  directly  from  Lemma 
3.1,  because  q ^ p,  but  can  be  easily  proved  in  a similar  way  to  Lemma  3.1. 
Corollary.  Suppose  that 

y(k)  = A(k)y(k  - 1)  T Bi(k)u(k  - i ) + d(k ) (3.19) 

i=i 

and  A(k ) is  invertible  for  any  k.  Then  there  is  a transformation 

9 

x(k)  — y(k)  — T;  Dj(k)u(k  — i)  (3.20) 

i=i 

such  that 

x(k  + 1)  = A(k  + l)x(k)  + B(k  + l)u(A:)  + £(&  + 1)  (3.21) 

D{(k)  = —A  1(k  + l)(5i+i(^  + 1)  — D{+i(k  + 1)) 

for  i — 1, ...,  q , D{(k  + 1)  = 0 if  i > q (3.22) 

B(k)  = B1(k)-D1(k ) (3.23) 

((k)  = d(k). 

From  Lemma  3.1,  we  see  that  the  realization  (3.15)-(3.16)  for  p < s has  the  same 
form  as  (3. 11)- (3. 12)  for  p > s except  some  extra  terms  in  the  output  equation. 

Let  a pair  ( F(k ),  G(k))  be  a system  described  as 

x(k  + l)  = F(k)x(k)  + G{k)u(k)  (3.24) 

and  let  <!>(&,  i)  denote  the  transition  matrix  of  (3.24),  i.e., 


$(jfc,i)  = F(k  - l)F(k  -2)...F(i)  for  k > *;  $(i,t)  = I. 
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For  any  positive  integer  N,  let  Yjv(k)  denote  the  ,/V-step  reachability  grammian 
defined  by 

k+N- 1 

YN(k)  = Y,  ®(k  + N,i  + l)G(i)G'(i)<f>'(k  + N,i  + 1).  (3.25) 

«=Jt 

Then  the  system  (F(k),  G(k))  is  called  uniformly  reachable  in  N steps  if 

Ys(k)  > r\I  for  some  rq  > 0 VA:  G Z+ . (3.26) 

where  Z+  denotes  a set  of  nonnegative  integers. 

Remark  3.2.  The  definition  of  uniform  reachability  is  due  to  [37]. 

Remark  3.3.  If  every  element  of  F(k),G(k ) is  bounded,  tr(Yjy(A;))  is  also 
bounded,  where  “tr”  denotes  the  trace  operation.  Thus  (3.26)  holds  if  and  only 
if  there  is  r2  > 0 such  that  | det(y/v(A’))  | > r2  VA;  G Z+ . 

Now  we  require  the  following. 

Assumption  \\  The  observable  realization  of  (3.1)  described  by  (3.1 1)-(3.12)  or 
(3.15)-(3.16)  is  uniformly  reachable  in  N steps  with  N >n. 

Remark  3.4.  The  observable  realization  is  always  uniformly  observable  in  n 
steps.  We,  therefore,  have  a realization  which  is  both  uniformly  reachable  (in  N 
steps)  and  uniformly  observable  (in  n steps). 

The  assumption  4 seems  too  restrictive.  In  fact,  an  observable  realization  may 
not  be  reachable  at  all  for  certain  systems.  Nevertheless,  we  can  reasonably  treat 
more  than  single  output  systems  (m  = 1)  and  all-state  measurable  systems  (p  = 1) 
with  this  assumption.  In  order  to  treat  more  general  cases,  we  can  invoke  the 
realization  theories  of  linear  time-varying  systems,  e.g.,  [38].  However,  algorithms 
are  generally  too  complicated  to  be  used  on-line,  so  we  will  not  treat  them  in  this 
dissertation. 


27 


In  most  cases,  we  have  prior  knowledge  about  the  unknown  parameter  matrix 
9*.  Thus,  we  assume  that  the  ij-th  element  of  0 * denoted  by  0tJ  belongs  to  the 
known  interval  [ 9™n  9^ax  ] for  all  i,j  [12,  13].  When  we  define  the  set  f l of  all 
possible  values  of  the  unknown  parameter  matrix  9*  by 

ft  = { 9 € 7^n+s)rxm  : 9ij  G [ 9%in  0%ax}}  (3.27) 

this  can  be  written  as  the  following. 

Assumption  5:  The  tuned  parameter  matrix  9*  G ft. 

Note  that  the  set  ft  is  compact,  thus  closed  and  bounded. 

Now  consider 

y(k)  = 9'(f>(k  — 1)  for  any  9 G ft.  (3.28) 

To  ensure  the  reachability  of  the  estimated  model,  we  make  the  following  assump- 
tion. 

Assumption  6 : There  is  e > 0 such  that  for  any  9 G fi,  any  k € Z+  an 
observable  realization  of  (3.28)  satisfies 

|det(Y}v(0,  fc))|  > e (3.29) 

where  Yn(9,  k ) is  its  reachability  grammian. 

When  p < s,  as  can  be  expected  in  Lemma  3.1,  we  make  the  following  additional 
assumption. 

Assumption  7:  If  p < s , then  there  is  £\  > 0 such  that 
|det(j4p(0, k))\  > £i  \/9e  ft,  Vk  G 2+ 

where  Ap(9,  k ) = £j=i  Apj  fj(k ) and  Apj,  j — 1, . . . , r are  submatrices  of  9'  for  any 
9 G ft. 
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3.2  Control  Objective 

Now  the  control  problem  is  to  find  a proper  adaptive  control  algorithm  which 
makes  the  closed-loop  system  robustly  globally  stable.  Robust  stability  means  that 
there  exists  D*  > 0 such  that  for  any  bounded  external  disturbances  d(k)  and  for 
all  possible  plant  uncertainties  6*(k ) which  satisfy 

suP{r(*)ii}  < d * 

the  closed-loop  system  is  stable  [17]. 


CHAPTER  FOUR 
PARAMETER  ESTIMATION 


In  this  chapter,  we  introduce  an  estmation  algorithm  for  9*  and  show  some 
properties  which  will  be  used  in  proving  the  closed-loop  stability.  This  estimation 
scheme  is  based  on  the  least  squares  algorithm  for  multi-output  systems  ( see  pp 
94-98,  in  Goodwin  and  Sin  [10])  but  includes  a forgetting  factor,  a dead  zone  and  a 
projection  operator.  In  addition,  all  signals  normalized  by  the  norm  of  the  regression 
vector  will  be  used. 

The  motivation  for  using  a forgetting  factor  is  to  allow  the  estimation  algorithm 
to  track  slowly  varying  parameters.  Conceptually,  this  implies  that  9*  is  allowed  to 
be  time-varying.  Let  9*(k)  denote  such  a time-varying  tuned  parameter  to  distin- 
guish it  from  the  constant  tuned  parameter  6*.  Then  sup  ||<5*(fc)||  due  to  9*{k ) will  be 
smaller  than  sup  ||<5*(A:)||  due  to  6*.  In  practice,  this  will  allow  us  to  have  a smaller 
dead  zone  in  the  estimation  algorithm.  For  certain  plant  models,  it  might  be  possi- 
ble to  track  ( 0 * -f  6*(k))  itself  with  small  errors  without  a dead  zone.  However,  in 
the  ensuing  analysis,  we  will  treat  only  the  constant  tuned  parameter  9*  because  of 
technical  difficulties. 

To  prevent  the  estimates  from  being  updated  in  the  wrong  direction  due  to  the 
disturbances  or  plant  uncertainties,  we  use  a dead  zone.  The  size  of  dead  zone  is 
determined  such  that  a Lyapunov  function  of  the  estimation  error  is  monotonically 
decreasing.  When  the  parameter  estimates  are  outside  of  fi,  we  project  them  onto 
the  boundary  of  H to  ensure  the  reachability  condition  (Assumption  6 in  Chapter 
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Three)  of  the  estimated  model.  A projection  operator  V will  be  used  for  this  purpose. 

The  sequence  of  the  algorithm  is  as  follows.  First,  we  normalize  signals.  Next, 
we  calculate  a forgetting  factor  A (k)  and  determine  the  value  of  a dead  zone  function 
represented  by  a(k ) whose  value  is  0 or  1.  After  that,  we  update  the  covariance 
matrix  P(k)  and  the  parameter  estimate  9(k).  Finally,  if  the  updated  parameter 
estimate  is  not  in  fl,  we  project  it  onto  fl. 

In  Section  4.1,  we  show  the  formulas  for  P(k)  and  6{k).  Next,  we  treat  a 
projecton  operator  in  Section  4.2,  a dead  zone  in  Section  4.3  and  a forgetting  factor 
A (k)  in  Section  4.4.  Finally,  we  show  some  properties  of  this  algorithm  in  Section  4.5. 

4.1  Estimation  Algorithm 

First,  we  define  the  prediction  error  e(k)  by 

e(k)  = y(k)-6\k-\)<i>(k-\)  (4.1) 

where  6(k  — 1)  is  the  estimate  of  the  tuned  parameter  matrix  6*  at  time  ( k — 1). 
Note  that  e{k)  is  not  a scalar  but  a vector  since  y(k ) is  a vector.  If  some  parameters 
are  known  a priori,  we  can  replace  (4.1)  by 

e(k)  = y(k)  - tp'(k  - 1 )(t>a{k  - 1)  - 0'(k  - 1 )<f>b{k  - 1) 

as  [13],  where  ip(k)  consists  of  the  known  parameters  and  cj)a(k ),  (t>b{k)  are  subvectors 
of  <{>{k). 

We  define  normalized  variables  for  </>(&),  y(k)  and  e(k)  as 

w(k)  = <f>(k)/n(k ) 

y(k)  = y(k)/n(k  — 1)  (4-2) 

e(£)  = e(k)/n(k  — 1) 
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where 


n 


(k)  = maxjl,  ||^(fc)||}. 


(4.3) 


Now  the  parameter  estimation  algorithm  is  given  by 


9(k)  = < 


v{0o(k)}  if  e0{k)<tn 

90(k)  otherwise 


(4.4) 


where 


90(k)  = 9{k  -1)  + a(k)P(k)w(k  - l)g'(Jb) 


(4,5) 


P(k)  - —ipik  - l)  - (4  6) 
(it)'  1 1 A(fc)  -f  p(k  — 1)  M 1 

P{ 0)  = P'(0)  > 0 


)u(A;  — 1)  = — 1)P(^  — 1 )w(k  — 1) 


(4.7) 


and  A (k)  is  a forgetting  factor  with  0 < A(A’)  < 1,  ct(k)  is  a function  which  has  a 
value  0 or  1 depending  on  the  dead  zone  and  V is  a projection  operator. 


4.2  Projection  Operator  V 

In  this  section,  we  develop  the  projection  operator  V used  in  the  parameter  es- 
timation algorithm  (4.4).  Suppose  that  90(k)  as  in  (4.5)  is  outside  of  fl,  so  we  want 
to  get  a substitute  6(k)  for  90(k)  such  that  9(k)  is  included  in  while  retaining  some 
important  properties  of  the  parameter  estimation  algorithm,  for  example,  proper- 
ties PI  - P3  in  Section  2.5  in  the  absence  of  the  plant  uncertainties  and  external 
disturbances.  When  we  choose  a Lyapunov  function  v(k)  as 


v{k)=  tr  {9’(k)P-\k)9(k)} 


(4.8) 
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with  the  estimation  error  9(k)  defined  by 

9(k)  = 9(k)  - 9*  (4.9) 

these  properties  are  all  retained  if 

v(k)  < va(k ) 

where 

v0(k)=  tr  {0o\k)P-'(k)Oo(k)}  (4.10) 

with 

90(k)  = 90(k)-9*.  (4.11) 

This  problem  leads  to  the  following  projection  method  [10,  11].  We  first  trans- 
form the  basis  for  the  parameter  space  by  the  linear  transformation  P~1'2(k).  Let 
p0{k)  and  Cl(k)  denote  the  images  of  90(k ) and  f l under  P~1/2(k).  Then,  we  can  find 
9(k)  by  orthogonally  projecting  each  column  vector  of  p0{k)  onto  the  boundary  of 
Cl  in  the  transformed  space.  We  will  describe  this  projection  algorithm  by  using  a 
projection  operator  V as 

V (80(k))  = 9(k).  (4.12) 

Now,  it  follows  from  the  definition  of  V that  for  9a(k)  £ 0 

v0(k)  - v(k)  > tr{(90(k)  - 8(k)),p-1(k)(90(k)  - 9{k))}.  (4.13) 

This  problem  can  be  reformulated  as  a problem  of  quadratic  programming,  i.e., 
to  find  a matrix  9(k)  minimizing  the  performance  index  J, t given  by 


Jk  = tr {(9(k)  - 90(k)yp-\k){9(k)  - 90(k))} 


(4.14) 
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such  that  0ij  E [ 9™n  9™ax  ] where  0{j  is  the  ij-th  element  of  0{k).  We  can  obtain  the 
solution  by  solving  a finite  number  of  equality  constraint  problems.  A disadvantage 
of  this  method  is  that  it  may  take  too  much  time  when  the  dimension  is  large.  For 
example,  if  9{k)  and  90(k ) are  n-dimensional  vectors  and  all  the  components  of  9a(k) 
are  out  of  bound,  then  at  worst  the  number  of  equality  constraint  problems  we  have 
to  solve  is 

nC\  + T1C2  + • • • + nCn- 2 + nCn-l 

where  pCq  is  the  9-th  binomial  coefficient  for  a p-th  order  polynomial.  An  alternative 
is  to  use  an  iterative  method  of  quadratic  programming.  The  choice  will  be  problem 
dependent. 


4.3  Dead  Zone 


In  this  section,  we  develop  the  dead  zone  function  a(k)  used  in  the  covariance 
update  equation  (4.6).  In  order  to  prevent  the  parameter  estimates  from  being 
updated  in  the  wrong  direction  due  to  the  external  disturbances  or  plant  uncertain- 
ties, we  use  a dead  zone.  The  size  of  dead  zone  is  selected  such  that  the  Lyapunov 
function  defined  as  (4.8)  is  always  monotonically  decreasing  in  spite  of  external 
disturbances  and  plant  uncertainties. 
a(k)  is  defined  by 


a(k)  = < 


1 ifA(*)<||S(*)|| 

0 otherwise 


(4.15) 


We  select  the  size  of  dead  zone  A (k)  as  follows.  Let  W and  and  D be  design 
parameters  which  satisfy 


VFi  < W and  Dx  < D 
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where  W\  and  D\  are  upper  bounds  in  the  norm  of  external  disturbances  and  plant 
uncertainties,  respectively,  as  in  Assumption  1 and  3 of  Chapter  Three.  With  these 
design  parameters,  for  any  sequence  of  forgetting  factors  {A(fc)}  with  0 < A (k)  < 1, 
we  define  A (k)  by 


A(A0  = 


~ l) 

A (*) 


w(k-l)\\D  + W/n(k-l)) 


(4.16) 


where  w(k  — 1),  n(k  — 1)  and  fi(k  — 1)  are  as  in  (4.2),  (4.3)  and  (4.7),  respectively. 
From  (4.5),  if  a(k)  is  equal  to  zero,  90(k)  is  not  updated  in  spite  of  a non-zero 
prediction  error  e(k ) (or  e(k)).  Thus  a(k ) can  be  regarded  as  a form  of  dead  zone 
function. 

As  in  Section  4.2,  we  define 


*(*) 

= 6(k)  - r 

(4.17) 

60(k) 

= 0o(k)-0*. 

(4.18) 

Then,  the  following  lemma  shows  that  a dead  zone  function  defined  above  makes 
the  Lyapunov  function  (4.8)  monotonically  decreasing. 

Lemma  4.1.  Consider  the  parameter  estimation  algorithm  (4.4)  - (4.7)  applied 
to  the  plant  (3.8)  with  the  function  a(k)  defined  in  (4.15)  where  D > D\  and 
W > Wi,  and  with  a projection  operator  V.  For  any  sequence  (A(&)}  such  that 
0 < \(k)  < 1,  the  function 

v(k)=  tr  {d'(k)p-\k)e(k)}  (4.19) 


is  nonnegative,  monotonically  decreasing  and  therefore  converges. 

Proof.  Clearly,  v(k)  > 0,  because  P(k)  > 0.  Since  a(k)  has  a value  0 or  1, 


a2(k)  = a(k). 


(4.20) 
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Postmultiplying  (4.6)  by  w(k  — l)a(fc)  and  using  (4.20),  we  get 

(4.21) 

Subtracting  9*  from  both  sides  of  (4.5)  and  using  (4.17)  and  (4.18),  we  have 

60(k)  = 9{k  - 1)  + a(k)P(k)w(k  - l)e'(k).  (4.22) 

Now  consider  (3.8)  with  9 and  8(k)  replaced  by  9*  and  8*(k ),  respectively.  Note 
that  9*  is  the  tuned  parameter  matrix  and  8*(k)  is  the  plant  uncertainty  matrix  for 
9*.  To  use  simple  notations,  we  will  regard  9 and  8(k)  as  9*  and  8*(k).  Inserting 
(3.8)  into  (4.1)  gives 

e(Jfc)  = -9\k  - 1 - 1))  + 6\k)4>{k  - 1)  + d(k).  (4.23) 

As  (4.2),  we  define  a normalized  variable  d(k ) for  d(k ) by 

d(k)=d(k)/n(k-l)  (4.24) 

where  n(k)  is  as  in  (4.3).  Then  it  follows  from  (4.23)  and  (4.24)  that 

e(k)  = — 9\k  — l)w(k  — 1)  + r/(k)  (4-25) 

where 

Tj(k)  = 8'(k)w(k  — 1)  + d(k).  (4.26) 

Combining  (4.22)  and  (4.25)  gives 

90(k)  = {/  - a(k)P(k)w(k  - l)w'(k  - 1 )}9{k  - 1) 


+ a(k)P(k)w(k  — l)rj'(k). 


(4.27) 
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On  the  other  hand,  postmultiplying  A (k)P  1 (k  — 1)  on  both  sides  of  (4.6),  and  using 
(4.21),  we  have 

A (^)JP(/b)JP-1(A:  - 1)  = / - a(k)P{k)w(k  - 1 )w'(k  - 1).  (4.28) 

Thus,  we  obtain  from  (4.27) 

90(k)  = X(k)P(k)P~1{k  - 1 )§(k  - 1)  + a(k)P(k)w(k  - l)i j\k)  (4.29) 

which  gives 

P-^k^k)  = X(k)P~1(k  - 1 )9{k  - 1)  + a{k)w(k  - l)r /(Jfe).  (4.30) 

Let 


v0(k)  = tr {0'o(k)P  *( k)90(k )}. 


(4.31) 


Inserting  (4.22)  and  (4.30)  into  (4.31),  we  get 


v0(k)  — A (k)v(k  — 1)  + a(k)  tr {9\k  — 1 )w(k  — l)?/(&)} 

, „n^X(k)  tv{e(k)w'(k-  l)§(k-  1)}  + /x(k-  1)  tr{e(fc)»/'(fc)}  00, 

+ “w 1 WTmT-T) (4'32) 

where  (4.7)  and  (4.21)  were  used.  Using  (4.25),  we  obtain  from  (4.32) 


v0(k)  — X(k)v(k  — 1)  = —a(k)H(k) 


(4.33) 


where 


H(k)  = 


A(fc)l|e(fc)||2 

\(k)  + fi{k-l) 


(4.34) 


Noting  that  9(k)  = 90(k)  if  9a(k)  6 0,  we  further  obtain  from  (4.13)  and  (4.33) 


v{k)  - X (k)v(k  - 1)  < -a{k)H(k) 


- tr {(90(k)  - ewyp-'ikWM  - 9(k))}. 


(4.35) 
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Since  sup{||d(fc)||}  < W and  sup{||<5(fc)||}  < D by  hypothesis  and 
d(k)/n(k  — 1)  from  (4.24),  the  equation  (4.26)  gives 


N*)ll  < \\w{k-l)\\D  + W/n(k-l). 


Therefore, 


H*(k)  < H(k) 


where 


= AW+'g-l)  ' (ll“(*  “ 1)1175  + ^ - 1)>2- 


On  the  other  hand,  from  (4.16)  and  (4.37),  we  have 


H*(k)  = 


A (k) 


X(k)  + fi(k-l) 

It  then  follows  from  the  definition  of  a(k) 


i(k)r-A(ky). 


0 < a(k)H*(k)  Vk 


which,  together  with  (4.36),  gives 


0 < a(k)H*(k)  < a(k)H(k). 


Therefore,  we  have  from  (4.35)  and  (4.40) 
v(k)  - \(k)v(k  - \)  < -a(k) H* (k)  -tr {(0o(k)  - 9(k))' P-\k)(0o(k) 

< 0 


d(k)  = 


(4.36) 

(4.37) 

(4.38) 

(4.39) 

(4.40) 

*(*))} 

(4.41) 


which  implies  that  v(k)  is  monotonically  decreasing.  Since  v(k)  > 0 for  any  Ar , it 
immediately  follows  that  v(k)  converges.  ■ 
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4.4  Forgetting  Factor  A(A’) 

In  order  for  the  estimation  algorithm  to  track  time-varying  parameters,  for- 
getting factors  are  used.  A simple  way  is  to  discount  the  old  data  exponentially. 
Suppose  that  a single-output  plant  model  is  given  by 

y(k)  = 0'<j)(k  - 1) 

and  thus  the  prediction  error  equation  is  written  as 

e(A)  = y(k)  — 6\k  — 1 )<f>(k  — 1) 

where  0(k)  is  the  parameter  estimate  at  time  k.  Then,  this  idea  is  formulated  as  a 
problem  to  minimize  the  following  performance  index: 

Jk  = {0  - e{Q))'\kP-l{0)(6  - 0(0))  + £ Afc-!(y(z)  - - l))2 

i=i 

where  A is  a constant  with  0 < A < 1.  The  solution  of  this  problem  leads  to  the 
standard  least  squares  algorithm  with  a constant  forgetting  factor.  This  simple 
exponential  discounting  works  well  if  the  plant  is  properly  excited.  However,  when 
the  excitation  is  not  sufficiently  rich,  this  forgetting  factor  may  cause  a problem 
called  “estimator  windup”  [30].  If  the  regression  vector  is  small  in  magnitude,  P{k) 
grows  exponentially.  Therefore,  the  estimator  gain  becomes  large  which  makes  the 
system  unstable.  Then,  the  plant  will  be  well  excited,  so  the  regression  vector  will 
become  small. 

To  avoid  estimator  windup,  many  kinds  of  variable  forgetting  factors  have  been 
suggested.  A simple  example  is  of  the  form  [32] 


\{k)  = a\(k  - 1)  + (1  - a) 
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with  0 < a < 1.  Another  example  is  the  constant  trace  algorithm  [16].  In  this 
algorithm,  forgetting  factors  are  selected  such  that  the  trace  of  the  covariance  matrix 
is  always  constant.  Ydstie  and  Sargent  [39]  proposed  a forgetting  factor  based 
on  the  Constant  information  principle.  The  constant  information  principle,  first 
introduced  by  Fortescue  [40],  is  an  assumption  that  at  each  step  the  amount  of 
forgetting  corresponds  to  the  amount  of  new  information  in  the  latest  measurement 
so  that  the  estimation  is  always  based  on  the  same  amount  of  information. 

We  will  follow  Ydstie  and  Sargent’s  constant  information  principle,  because  it 
is  based  on  the  reasonable  assumption  as  well  as  it  provides  some  nice  properties 
which  are  very  useful  to  prove  the  closed-loop  stability.  The  key  property  is  that  the 
infinite  product  of  forgetting  factors  is  uniformly  bounded  below  by  some  positive 
constant.  It  follows  from  this  property  that  the  covariance  matrix  P(k)  and  the 
estimates  are  uniformly  bounded  and  converge  [39].  We  will  further  extend  this 
concept  such  that  it  can  cover  the  plant  model  which  has  plant  uncertainties  or 
external  disturbances. 

We  now  briefly  review  Ydstie  and  Sargent’s  work.  The  presentation  here  will 
differ  from  that  of  Ydstie  and  Sargent  by  considering  multi-output  systems  with 
signal  normalization,  while  their  original  work  was  for  single  output  systems  without 
signal  normalization.  Consider  the  following  multi-output  plant  model  with  signals 
normalized  as  in  (4.2): 

y(k)  = 9'w{k  - 1).  (4.42) 

The  prediction  error  equation  is  then  given  by 


e(k)  = y(k)  — 9'(k  — 1 )w(k  — 1). 


(4.43) 
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Define  the  performance  index  Jk  as 


Jk  = tr{(0  - 0(O))V(O,  k)P~\0)(9  - 6(0))} 

+ £ *(*\  *)l|y(*)  - - i)ll2 

i=i 

where 


cr(i,  fc)  = A(i  + l)A(i  + 2)  • • • A (k)  for  i < k;  a(k,  k)  = 1. 

By  some  manipulation  of  (4.44),  we  obtain 

Jk  = tr{(0  - 6(k))'P-l(k)(6  - 0(Jfc))}  - tv{0(k)'P-\k)6(k))} 
+ <7(0,  k) tr{y/(0)JP“1(0)«(0))}  + £ a(i,  fc)||y(0||2 

t=i 

where 

k 

P_1(k)  = <r(0,  k)P~x( 0)  + ^2  cr(i,  k)w(i  — 1 )w'(i  — 1) 

t=i 

k 

6(k)  = P(fc){<7(O,k)P_1(O)0(O)  + ]T  <r(*,  k)w(i  - l)y'(i)}. 

1 = 1 

Equation  (4.47)  can  be  written  in  the  recursive  form 

P-'ik)  = X(k)P~1(k  - 1)  + w(k  - 1 )w'(k  - 1). 


On  the  other  hand,  using  (4.42)  we  have  from  (4.48) 


6(k)  = P(k)w(k  - l)y'(k)  + P(k)\(k)P-\k  - 1 )9(k  - 1). 


It  then  follows  from  (4.43)  and  (4.50)  that 


9(k)  = P(k)w(k  - l)(e(Jb)  + 9'(k  - l)w(k  - 1))' 


(4.44) 

(4.45) 

(4.46) 

(4.47) 

(4.48) 

(4.49) 

(4.50) 


+ A(k)P(k)P~1(k-l)9(k-l). 


(4.51) 
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P ostmultiplying  P{k)  on  both  sides  of  (4.49)  yields 

/ = A (k)P(k)P~x(k  - 1)  + P(k)w(k  - l)w'(k  - 1).  (4.52) 

Inserting  (4.52)  into  (4.51)  gives 


0(k)  = 0(k-l)  + P(k)w(k-l)e?(k).  (4.53) 

Equations  (4.49)  and  (4.53)  is  the  standard  least  squares  algorithm  with  a forgetting 
factor. 

Now,  let  N(k ) denote  the  value  of  J ^ for  6(k ).  Then,  we  have  from  (4.46) 

N(k)  = a(0,k)  tr{0'(O)P_1  (0)0(0)} 

+ 5>(*>*)  \\y(i)\\2  -HO’(k)P-\k)0(k)}  (4.54) 

«=i 

which,  together  with  (4.53),  gives 


N(k)  = \\y{k)\\2  - tr {9{k  - 1 )' P~\k)9(k  - 1)} 

— 2 w'(k  — 1 )&{k  — l)e(fc)  — ||e(fc)||2u/(fc  — 1 )P(k)w{k  — 1) 

+ A(fc)  (<7(0,  k - l)tr{0(O)'P_1  (0)0(0))}  + £ a(i,  k - 1)  ||y(i)||a 


.(4.55) 


Using  (4.43),  (4.49)  and  (4.54),  we  have  from  (4.55) 

N(k)  = X(k)N(k  - 1)  + || e(A;) ||2(  1 - w\k  - 1 )P{k)w{k  - 1)).  (4.56) 


By  using  the  matrix  inversion  lemma,  (4.49)  can  be  written  as 

r,ti\  n !\  P(k-l)w(k-l) 

Therefore,  noting  that  y(k)  = w'(k)P(k)w(k ),  we  get  from  (4.56)  the  recursive  form 
for  N(k) 


A(fc)||e(fc)||2 

A (k)  + fi(k  - 1)‘ 


N(k)  = X(k)N(k  — 1)  + 


(4.57) 
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We  regard  N(k)  as  a measure  of  information.  Then  setting  N(k)  = N( 0)  for 
some  ./V(O)  > 0 gives  a forgetting  factor  due  to  the  constant  information  principle. 
To  make  the  computation  of  A (k)  easier,  (4.57)  can  be  modified  to 

N(k)  = \(k)N(k  - 1)  + (4.58) 

In  practice,  the  resulting  algorithm  will  behave  almost  like  that  of  (4.57).  From 
(4.58),  we  see  that  setting  N(k)  — N( 0)  gives 

0 < A (A;)  <1  Vk  6 Z+  (4.59) 

where  Z+  is  the  set  of  nonnegative  integers.  Moreover,  Ydstie  and  Sargent  [39] 
showed  that  P{k)  and  9(k)  are  uniformly  bounded  and 

lim  A (k)  = 1 

k-*-oo 

when  the  algorithm  (4.49)  and  (4.53)  with  a forgetting  factor  formula  (4.57)  or  (4.58) 
is  applied  to  the  linear  time- invariant  plant  without  plant  uncertainties  or  external 
disturbances. 

However,  when  applied  to  the  plant  model  with  unmodelled  plant  uncertainties 
or  external  disturbances,  this  forgetting  factor  does  not  guarantee  the  boundedness 
of  P{k)  or  8(k).  Thus,  we  need  to  modify  the  measure  of  information  N(k). 

Roughly  speaking,  if  new  information  described  by  the  second  term  on  the 
right  side  of  (4.58)  is  presumably  bad,  we  neglect  it.  The  decision  is  made  by  the 
dead  zone,  i.e.,  if  ||e(fc)||  is  less  that  the  size  of  dead  zone  A (k)  at  time  k , the  new 
information  is  neglected. 

Based  on  this  idea,  we  modify  (4.58)  as 


N(k)  = A (k)N(k  - 1)  + S(H;{k)) ; N{ 0)  > 0 


(4.60) 
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where  the  function  S'(-)  is  defined  by 

S(x ) = < 


x if  x > 0 
0 otherwise 


and 


Hl(k)  = M1(k)X(k)  - M2(k) 


with 


Ma(fc)  = 


\e(kW 


1 + n(k  - 1) 

M2(k)  = (\\w(k  - 1)||D  + W/n(k  - l))2 


Now  we  choose  A (k)  such  that 


N{k)  = N{ 0)  \/k  G Z+ 


where  ^(0)  is  a design  parameter.  Then  (4.60)  gives 


iV(0)  = X(k)N(0)  + S(H*(k)). 


It  then  follows  from  (4.62)  and  (4.64)  that 


(4.61) 


(4.62) 


(4.63) 


(4.64) 


X(k)  = < 


(N{ 0)  + M2(k))/(N(0)  + Mi(k))  if  M2(k) / M\(k)  < 1 


otherwise 


which  always  satisfies  0 < A (k)  < 1. 


(4.65) 


4.5  Properties  of  the  Proposed  Estimation  Algorithm 

In  this  section,  some  important  properties  of  the  proposed  algorithm  will  be 
summarized  in  the  following  two  lemmas. 
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Lemma  4.2.  If  \(k)  is  chosen  such  that  it  satisfies  (4.64),  then 


i)  there  exists  a*  > 0 such  that 

a*  < <r(0,  Jfe)  Vfc  > 0 (4.66) 

where 

cr(f,  k ) = A (i  -f  l)A(i  + 2)  • • • A (k)  for  i < k;  a(k,  k)  = 1.  (4.67) 

ii)  A (k)  — > 1,  as  k — > oo.  (4.68) 

Proof.  Solving  (4.60)  recursively,  we  obtain 

k 

N(k)  = <7(0,  k)N( 0)  + J2  *(*\  k)S(H;(k)).  (4.69) 

«=i 

Now,  from  the  definition  of  a(k ) and  H*(k)  defined  in  (4.38),  we  have 

a(k)H*(k)  = S(H*(k)).  (4.70) 

This,  together  with  (4.41)  gives 

k 

v(k)  < a(0,k)v(0)  -J2°(iik)S(H*(i)) 

t=l 

- *(*>  fc)tr{(^0(i)  - e{i))'p-1{i)(e0(i)  - 0(f))}  (4.71) 

i=i 

Comparing  (4.62)  with  (4.37),  we  obtain 

H{{k)  < H*(k).  (4.72) 

Since  (4.72)  implies  that  S(H^(k))  < S(H*(k)),  it  follows  from  (4.71)  that 

k 

v(k)<  <t(0,  Jfe )»(0)  - Y,  <’■(*',  k)S(H;(k)). 


(4.73) 
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From  (4.69)  and  (4.73),  noting  that  N(k)  = N( 0),  we  have 


<r(0,  k)  > N(0)/(N(0)+v(0)) 


(4.74) 


which  implies  Property  i).  ii)  follows  immediately  from  i).  ■ 

Lemma  4.3.  For  the  parameter  estimation  algorithm  (4.4)  - (4.7)  applied  to 
the  plant  (3.8)  with  the  function  a(k ) given  in  (4.15)  where  sup{||d(fc)||}  < W 
and  sup{||<!)(fc)||}  < D and  with  the  forgetting  factor  given  in  (4.65),  the  following 
properties  hold: 


i)  P{k)  is  uniformly  bounded  and  converges. 


ii)  limsup{||e(fc)||  — K(\\w(k  — 1)117)  + W/n(k  — 1))}  < 0 (4.75) 

k 

where  K > (1  + HP(oo)H)1^2  and  lim  P(k ) = P(oo). 


iii)  6(k ) is  uniformly  bounded.  Furthermore, 


||0(&)  — 6(k  — l)|j  — *•  0,  as  k — * 


oo 


(4.76) 


iv)  The  size  of  dead  zone  A(&)  given  by  (4.16)  is  bounded. 


Proof.  By  the  matrix  inversion  lemma,  (4.6)  can  be  written  as 


p-'(k)  = X (k)P-\k  - 1)  + a(k)w(k  - l)w'(k  - 1) 


(4.77) 


which  gives 


p-\k)  = a(0,k)M-\k) 


(4.78) 


where 


(4.79) 
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From  (4.78),  we  have 

A mm(-P_1(£))  > cr(0,  A;)Amtn(p-1(0))  (4.80) 

where  A min(A)  denotes  the  minimum  eigenvalue  of  a matrix  A.  It  then  follows  from 
(4.74)  and  (4.80)  that 

11^)11  <(l  + »(0)/JV(0))||P(0)||  (4.81) 

which  implies  that  P(k)  is  uniformly  bounded.  On  the  other  hand,  it  is  already 
known  that  M(k)  converges  (see,  e.g.,  Samson  [8])  and  it  follows  from  Lemma  4.2 
i)  that  a sequence  {<r(0.  A:) } converges  and  the  limit  <r(0,  oo)  satisfies  <t(0,oo)  > a* 
for  some  cr*  > 0.  Thus  (4.78)  implies  that  P(k ) converges. 

To  show  ii),  we  have  from  (4.71) 

k 

v(k)  < <r(0,  k)v( 0)  S(H*(i)) 

»=i 

- **  £ tr{(tfo(0  - 6(i)yp-\i)(0o(i)  - 0(0)}  (4.82) 

t=i 

where  a*  is  as  in  (4.66).  Since  v(k)  converges  due  to  Lemma  4.1  and  for  any  i G 2+ 
S(H*(i ))  > 0 , tr{(0o(O  — 9(i))'P~1(i)(90(i)  — 0(0)}  — it  follows  from  (4.82)  that 

S(H*(i ))  — > 0,  as  i — » oo  (4.83) 

tr{(0o(O  — 0(i))'P-1(i)(0o(i)  — 0(0)}  ->  0,  as  i oo.  (4.84) 

Combining  (4.83)  and  (4.37)  gives 

limfcSUp{A(l)A+/0^-  1)  ~ ^W(<k  ~ + W!n(k  ~ ^ ~ °'  (4'85) 

Since  ||iy(A:)||  < 1 and  n{k ) > 1 for  any  k,  (||u;(k  — 1) ||ZA  + W/n(k  — 1))  is  bounded. 
Thus,  using  the  facts  that  n(k)  = w'(k)P(k)w(k)  < ||P(fc)||  and  \(k)  — ► 1 as 
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k — ► oo,  we  obtain  from  (4.85) 


limsup{||e(fc)||2  - (1  + HPMIDdHfc  - 1)||D  + W/n(k  - l))2}  < 0.  (4.86) 

k 

Now,  ii)  readily  follows  (4.86). 

Since  0(k)  is  always  in  Q and  Q is  bounded,  9(k)  is  uniformly  bounded.  To 
prove  that  ||0(fc)  — 9(k  — 1)||  converges  to  zero,  we  first  show  that 


a(k)P(k)w(k  — 1)  — >•  0,  as  k — > oo. 


(4.87) 


From  (4.6),  we  have 


Noting  that  P(k)  and  cr(0,i:)  converge  and  that  cr(0,i)  > a*  for  some  a*  > 0,  we 
have  from  (4.88) 


a(k) 


P(k  — 1 )w(k  — 1 )w\k  — 1 )P(k  — 1) 
\(k)  + n(k  - 1) 


0,  as  k — > oo. 


(4.89) 


Since  A (k)  — > 1 as  k — ► oo,  for  given  e with  0 < e < 1 there  exists  a positive  integer 
T such  that 


X(k)  > 1 — e for  k > T. 


Thus,  using  (4.20)  and  (4.21)  we  have 


a2(k)P(k)w(k  — 1 )w'(k  — 1 )P(k) 


= a(k) 


P(k  — 1 )w(k  — 1 )w'(k  — 1 )P(k  — 1) 


A(&)  + fi(k  - 1) 


A (k)  + /j,(k  - 1) 


~ [ ’ \(k)  + fi(k  — 1)  1 — £ 


(4.90) 


This,  together  with  (4.89),  yields  (4.87). 
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Since  9[k)  is  bounded,  ||ty(A:)||  < 1 and  8(k),  d(k)  are  bounded,  it  follows  from 
(4.25)  that  e(k ) is  also  bounded.  Therefore,  we  have  from  (4.5)  and  (4.87) 

90(k ) — 9{k  — 1)  — > 0 as  k — > oo.  (4.91) 

On  the  other  hand,  noting  that  P(k)  is  uniformly  bounded  and 

tr{(0o(fc)  - 0(k)yP~1(k)(Oo(k)  - 9(k))}  > 

tr {(90(k)  - 9{k))\90{k)  - 0(k))}/Xmax(P(k)) 

where  Amai;(A)  is  the  maximum  eigenvalue  of  a matrix  A,  we  have  from  (4.84) 

9(k)  — 90(k ) — > 0 as  k — > oo.  (4.92) 

Combining  (4.91)  and  (4.92)  gives  (4.76). 

Now,  we  show  iv).  Since  (4.2)  implies  that  ||u;(fc)||  < 1 and  P(k)  is  uniformly 
bounded,  fi(k)  given  by 

n(k)  = w'(k)P(k)w(k) 

is  uniformly  bounded.  On  the  other  hand,  Lemma  4.2  i)  implies  that  A (k)  is  bounded 
below  by  <7*  and  (4.3)  gives  n(k  — 1)  > 1.  Therefore,  it  follows  from  (4.16)  that 
A (k)  is  bounded.  ■ 

Note  that  the  boundedness  of  A (k)  is  due  to  the  signal  normalization  which 
makes  fi(k)  bounded.  Lemma  4.3  also  holds  for  the  estimation  algorithm  without 
projection.  The  proof  is  almost  same  as  Lemma  4.3  except  for  the  uniform  bounded- 
ness of  9{k).  The  uniform  boundedness  of  9(k)  for  the  algorithm  without  projection 
comes  from  the  fact  that  v(k)  as  defined  in  (4.19)  is  monotonically  decreasing.  Since 


v(k)  < v(0) 
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the  definition  of  v(k ) gives 

||0'(*)WII  W^_1(*))  < u(0).  (4.93) 

Using  (4.81),  we  obtain  from  (4.93) 

II0WWII  < t;(0)||P(0)||(l  + v(0)/N(0))  (4.94) 

which  implies  that  0(k)  is  uniformly  bounded. 

Remark  4.1.  The  equation  (4.76)  in  Lemma  4.3  does  not  imply  that  9(k)  is  a 
convergent  sequence.  For  example,  consider  a sequence  of  real  numbers 


x(k  + 1)  = x(k ) -f  v(k ) 


where 


{u(A:)}  = {1  -I 

1 V X 2 2 3 3 4 


1 


4 6 8 

Even  though  v(k)  converges  to  zero,  x(k)  does  not  converge. 


CHAPTER  FIVE 
ADAPTIVE  CONTROLLER 


In  this  chapter,  we  construct  a controller  which  makes  the  estimated  model 
exponentially  stable.  Basically,  we  follow  Samson  and  Fuchs’  idea  [7]  to  construct 
the  controller.  The  controller,  suggested  by  Samson  and  Fuchs,  consists  of  a state 
observer  for  the  plant  and  a state  feedback  control  law  for  the  estimated  model.  The 
observer  gain  was  selected  such  that  all  observer  poles  were  located  in  the  origin  and 
the  control  law  was  designed  such  that  it  made  the  estimated  model  exponentially 
stable.  With  this  controller  structure,  they  showed  that  the  closed-loop  system  for 
the  linear  time-invariant  plant  can  be  made  exponentially  stable,  in  the  absence  of 
the  plant  uncertainties  or  the  external  disturbances. 

We  will  construct  a controller  with  the  same  philosophy  as  Samson  and  Fuchs 
[7],  but  in  the  different  setting,  i.e.,  for  linear  time-varying  plants  which  may  be  fast 
time-varying  with  plant  uncertainties  and  external  disturbances.  Moreover,  we  will 
use  a different  state  feedback  control  law  based  on  Kleinman’s  method  (see  Kamen 
and  Khargonekar  [41]  for  the  linear  time-invariant  system,  and  Moore  and  Anderson 
[20]  or  Cheng  [42]  for  the  linear  time- varying  system)  to  make  the  estimated  model 
exponentially  stable.  Its  advantage  over  other  methods  such  as  the  LQ  (Linear 
Quadratic)  control  law  [43],  the  pole  placement  and  the  transfer  function  approach 
[36,  45],  is  ease  of  implementation,  especially  when  the  reachability  index  N is 
greater  than  the  system  order. 
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We  assume  that  Ap(k)  is  invertible  and  therefore  the  plant  model  with  p < s has 
a realization  with  the  same  order  as  the  plant  model  with  p > s (see  Section  3.1). 
Furthermore,  we  assume  that  p > s and  the  plant  model  of  the  form  (3.11)  and 
(3.12).  However,  the  argument  in  this  chapter  can  be  also  applied  to  the  plant 
model  with  p < s mutatis  mutandis. 

Consider  the  regression  model  (3.28).  Since  the  model  depends  on  9 in  H, 
its  observable  realization  can  be  viewed  as  a function  of  6.  Thus,  to  indicate  the 
dependency  of  9 , we  will  denote  a system  matrix  F(k)  and  an  input  matrix  G{k ) of 
(3.28)  by  F(9,  k)  and  G(9:  k ),  respectively.  Further,  substituting  9(k)  for  9 will  give 
the  observable  realization  ( F(9,k),G(9,k ))  of  the  estimated  model. 


5.1  Adaptive  State  Observer 


Let  an  n x 1 vector  x(k)  denote  the  estimated  state  vector  of  the  plant.  Then 
the  adaptive  state  observer  is  given  by 


x(k  + 1)  = F(6(k),  k)x{k)  + G(9(k),  k)u(k)  + M{k)v(k)  (5.1) 


where 


F(9(k),k) 


•j(k)  — y(k)  — Hx(k) 

Ai(#(fc),  k + 1) 

I 

0 ■■ 

• 0 

A2(9(k),k  + 2) 

0 

I 

0 

Ap_ i{9(k),  k + p — 1) 

0 

0 •• 

• I 

Ap(9(k),  k + p) 

0 

0 • 

■■  0 

(5.2) 


(5.3) 
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and 


G(0{k),k)  = 


B1(6{k),k+  1) 
B2(9(k),k  + 2) 


1) 

Bp(9(k),  k + p) 


(5.4) 


//■  = [/  0 - ■ - 0 ].  (5.5) 

Note  that  B{(9(k),  k+i)  = 0 if  i > s.  Associated  with  (3.3)-(3.4),  Ai(9(k),  k + i), 
i = 1,  ...,p  in  (5.3)  and  Bi(9(k ),  k + i),  i = 1, 5 in  (5.4)  can  be  written  as 

Ai(9(k),  k + i)  = jr  Aij(k)fj(k  + i),i  = 1,  ...,P  (5.6) 

j=  1 

5,-(0(fc),  fc  + 0 = £ Bij(k)fj(k  + *),*  = 1,  -,s  (5.7) 

j=i 

where  Aij(k)  and  Bij(k)  are  submatrices  of  $'(&)  satisfying 

0'(fc)  = [ An(fc)  • • • Alr(fc)  • • • Apl(fc)  • • • Apr (A:)  fln(fc)  • • ■ Bsr(k)  ].  (5.8) 

We  select  M(k)  such  that  all  of  the  observer  poles  are  located  in  the  origin,  i.e., 

M\k)  = [ A[(9(k),  k + 1)  A'2(9(k),  k + 2)  • • • A’p(0{k),  k + p)}.  (5.9) 

This  type  of  observer  has  been  frequently  used,  see  for  example,  Samson  [8],  Ossman 
and  Kamen  [13],  and  Kreisselmeir  [29]  in  the  more  general  form. 

5.2  Kleinman’s  Method 

When  a pair  ( F(k),G(k ))  is  uniformly  reachable  in  N steps,  the  feedback  con- 
trol law  according  to  Kleinman’s  method  [20]  is  given  by 


u(k)  = —L(k)x(k) 


(5.10) 
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where 

L{k)  = G'(k)&(k  + N + 1,  k + 1 )Yfil1(k)^(k  + N + 1,  k + 1 )F(k)  (5.11) 

and  YN(k)  is  the  iV-step  reachability  grammian  defined  in  (3.25).  Note  that  Y/v+i(k) 
was  used  instead  of  Y^(k).  The  validity  of  Kleinman’s  method  is  based  on  the 
following  theorem  [20]. 

Theorem  5.1.  With  (F(k),G(k))  uniformly  reachable  in  N steps,  (F(k),G(k)) 
bounded  above,  and  the  state  feedback  gain  L(k)  defined  in  (5.11),  the  closed  loop 
system 

x{k  + 1)  = (F(k)  - G{k)L(k))x(k) 

is  exponentially  stable. 

5.3  Properties  of  the  Estimated  Model  as  a Sequence 

Corresponding  to  the  notation  introduced  for  F(9,k)  and  G(9,k ),  we  will  use 
L(9,  k)  and  Yn+i(9,  k)  instead  of  L(k)  and  Y^+i(k)  to  indicate  the  dependency  on  9. 
It  is  useful  to  consider  two  families  {F(9,k)}  and  {G(9,k)}  of  continuous  functions 
on  where  each  of  F(9,k)  and  G(9,k)  with  a fixed  A:  is  a member.  Then  we  have 
the  following  lemma. 

Lemma  5.1.  { F(9 , k )}  and  {G(9,  fc)}  are  equicontinuous  and  uniformly  bounded 
on  f 2. 

Remark  5.1.  A family  F whose  member  is  a continuous  function  defined  on  f \ 
is  said  to  be  equicontinuous  on  f l if  for  every  e > 0 there  exists  8 > 0 such  that  for 
any  / E F ||/(x)  — f(y) ||  < e whenever  ||x  — ?/||  < 8,  x G and  y E fl  [46]. 
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Proof  of  Lemma  5.1.  It  suffices  to  prove  only  for  {G(9,  &)},  since  the  same 
argument  can  be  used  for  {F(9,k)}. 

Replacing  9(k ) in  (5.7)  by  9 defined  as  (3.5),  we  have 

r 

Bi(6 , k + i)  — ^ Bijfj(k  + i),  i = 1, ...,  s (5.12) 

j= i 

Then  (5.4)  gives 

l|G(M)||2  = twtBMk  + iW 

i=i  j= i 

< ±±  MW  + itf  (5.13) 

i=i  j= i 

Since  fj(k),  j — 1,  ...,r  are  bounded  functions,  there  is  M > 0 such  that 

\fJ(k)\<M  VkeZ+,j  = l,...,r.  (5.14) 

Let  A be  an  x n2  matrix.  Then  the  Frobenius  norm  [47]  ||A||f  of  A defined 
as 

/ ni  n2  \ 1//2 

MIIf-  EEl«,f 

\‘=1 3=1  / 

satisfies 

Mil  < Pllr  < M (5.15) 

where  atJ  is  the  ij-th  element  of  A and  ||A||  is  the  Euclidean  norm  of  A. 

Further,  we  define  a m x sr  submatrix  B of  9'  by 

B = [Bu--.Blr---Bpl---Bar]  (5.16) 

Using  (5.14),  (5.15)  and  (5.16),  we  obtain  from  (5.13) 

||G(M)|| 2 < M2m||jB||2 


< M2m\\9\\\ 


(5.17) 
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Since  ft  is  compact  (thus  bounded),  uniform  boundedness  of  { G(9 , k)}  immediately 
follows  from  (5.17). 

For  given  £ > 0,  let  8 = £/(My/rn).  Then,  from  the  definition  of  G(9,k)  and 
(5.17),  we  get 

\\G(x,k)-G(y,k)\\  = \\G(x-y,k)\\<e 

for  all  x and  y in  ft  for  which  ||x  — y\\  < 8,  which  implies  that  { G(9 , k )}  is  equicon- 
tinuous  on  f L ■ 

Lemma  5.2.  Suppose  that  {/n(x)  : H — *>  II,  n £ Z+j  is  equicontinuous  on  ft,  y> 
is  uniformly  continuous  on  LI,  and  hn(x)  = y?(fn(x)).  Then  {hn(x)}  is  equicontin- 
uous on  ft.  Moreover,  if  {/n(^)}  is  uniformly  bounded  on  ft,  then  {hn(x)}  is  also 
uniformly  bounded  on  ft. 

The  froof  of  Lemma  5.2  follows  directly  from  the  definitions. 

Lemma  5.3.  {Yiv+i(9,k)}  is  equicontinuous  and  uniformly  bounded  on  ft. 
Proof.  Define  gk{9)  on  ft  by 

gk{9)  - [ F{9,  k)---  F{9 , k + N-1)  G(9,  k)---  G{9 , k + N-  1)  ] 

and  choose  a compact  set  II  such  that 

gk{0)e  n for  9 eft,  kez+. 

Such  a compact  set  always  exists  because  {<7fc(0)}  is  uniformly  bounded  on  ft  by 
Lemma  5.1.  From  the  definition  of  YN+\(9,k)  as  in  (3.25),  we  see  that  any  element 
of  YN+1(9,k)  is  a uniformly  continuous  function  on  II.  Then  the  result  immediately 
follows  from  Lemma  5.2.  ■ 

Corollary.  { det(yjv+i(0,  Jfc))  } and  {a,d]{YN+l{9,k))}  are  equicontinuous  and 
uniformly  bounded  on  ft,  where  adj(Y/v+i(05  k))  is  the  adjoint  matrix  of 
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Lemma  5.4.  {V)v+i(^ &)}  is  equicontinuous  and  uniformly  bounded  on  Cl. 

Proof.  It  follows  from  Assumption  6 in  Section  3.1  that  YfJ^(9,  k)  always  exists 
for  any  k e Z+ , any  9 € Cl.  The  remainder  of  the  proof  follows  from  Lemma  5.2 
and  Corollary  of  the  Lemma  5.3.  ■ 

Corollary.  {L(9,  k)}  and  {F(9,k)  — G(9,k)L(9,k)}  are  equicontinuous  and 
uniformly  bounded  on  Cl. 

5.4  Closed- Loop  Stability  of  the  Estimated  Model 

In  this  section,  it  will  be  shown  that  we  can  construct  a feedback  control  law 
based  on  Kleinman’s  method  which  makes  the  estimated  model  exponentially  stable. 
Let  YN+1(6(k),---,9{k  + N),k)  and  L{9{k),  ■ ■ ■ ,9(k  + N),k)  be  the  {N  + l)-step 
reachability  grammian  and  Kleinman’s  feedback  gain,  respectively,  at  time  k for  the 
system  (F(9(k),k),G(9(k),k)).  Note  that  the  (N  + l)-step  reachability  grammian 
and  Kleinman’s  feedback  gain  at  time  k are  functions  of  9{k  + 1),  • • • , 9{k  + N)  as 
well  as  of  9(k). 

Since  we  do  not  know  the  future  parameter  estimates,  we  will  use  the  current 
estimate  9(k)  instead  of  9{k  + 1),  - • • ,9(k  + N)  to  calculate  the  (N  + l)-step  reach- 
ability grammian  and  Kleinman’s  feedback  gain  of  the  estimated  model  at  time 
k.  The  resulting  reachability  grammian  will  be  Yiv+i(9(k),  - ■ ■ ,9(k),k)  or  simply 
YN+i(9(k),  k)  which  comes  from  the  notation  YN+i{9,k)  already  introduced,  and 
Kleinman’s  feedback  gain  will  be  L(6(k),  • • • , 9(k),  k)  or  simply  L(9(k ),  k). 

Now,  consider  y)v+1(0i,  • • • , #yv+ 1,  k)  and  L(9 1,  • • ■ , 9^+ i,  k)  as  functions  defined 
on  ClN+1  V^i,  • • • , 0yv+i  6 Cl.  Then,  these  functions  are  also  equicontinuous  and 
uniformly  bounded  on  Cl  (which  can  be  proved  in  a similar  way  to  Lemma  5.3). 
Using  these  notations  and  facts,  we  have  the  following  theorem. 
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Theorem  5.2.  With  the  parameter  estimation  algorithm  introduced  in  Chapter 
Four,  let  9(k)  be  the  estimated  parameter  matrix  at  k- th  instant.  Then  there  exists 
a bounded  sequence  of  feedback  gains  {L(9(k),  k))}  such  that 

(F(9(k),k)-G(9(k),k)L(9(k),k)) 

is  exponentially  stable. 

Proof.  By  Corollary  of  Lemma  5.4,  { L(9 , k)}  obtained  from  Kleinman’s  method 
is  equicontinuous  and  uniformly  bounded  on  ft.  Since  9(k)  E ft  for  any  k E Z+ , it 
follows  that  {£(#(&),&)}  is  bounded. 

Now,  by  Assumption  6 in  Section  3.1,  there  is  e > 0 such  that 

\det(YN(9,k))\>eVk€Z+,  \/9  E ft.  (5.18) 

Moreover,  since  det(Vjv+1(0i,  ■ • • , #at+i,  k))  is  equicontinuous  on  ft  (which  can  be 
proved  in  a similar  way  to  Lemma  5.3),  for  given  t2  > 0 with  e2  < £ there  exists 
T E Z+  such  that  for  any  k > T,  for  any  9 6 fi 

|det(V:v(0,  *))|  - |det(y^+1(^(Ar),  • ■ • , 9(k  + N),  k))\  < e2.  (5.19) 

Combining  (5.18)  and  (5.19)  gives  for  any  k > T 

0 < e — e2  < \det(YN+1(9(k),  ■■■,9(k  + N),  A))| 

which  implies  that  the  estimated  model  is  uniformly  reachable  in  N steps  for  k > T. 

Therefore,  ( F(9(k),k ) — G(9(k),k))L(9(k),  • • • ,9(k  + N),k))  is  exponentially 
stable  and  thus  for  given  8 with  0 < 8 < 1,  there  exists  t E Z+  such  that 


||Jt(*)ll  < 8/2  \/k>T 


(5.20) 
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where 

jt(k)  = l[(F(0(k  + z),k  + i)- 

t=i 

G(9(k  + i),  k + i)L(9(k  + i),---,9(k  + N + i),k  + i)).  (5.21) 

Since  F(9,  k)  and  G(9 , k ) are  uniformly  bounded  on  f 1,  L(9(k),  • • • ,9(k  + N),  k)  is 
equicontinuous  on  fl,  and  ||0(fc)  — 9(k  — 1)||  converges  to  zero,  there  is  T*  G i?"1"  with 
T*  > T such  that  for  any  k >T* 

t 

\\Jt(k)-H(F{9(k  + i),k  + i) 

1 = 1 

- G{9(k  + 0,  lb  + i)£(0(*  + *)»  ^ + 0)11  < 6 /2-  (5-22) 


It  then  follows  from  (5.20)  and  (5.22)  that  for  any  k > T* 

t 

||  |Q(F(0(A;  + ?),  A;  + i)  — G(9(k  + i),k  + i)L(9(k  + i),  k + i)||  < 8. 

;=i 

which  completes  the  proof.  ■ 


5.5  Adaptive  Control  Law 

We  now  define  an  adaptive  control  law  by 

u(k)  = -L{9{k),  k)x(k)  + r(k)  (5.23) 

where  x(k)  is  the  estimated  state  vector  given  in  (5.1),  L(9(k),k)  is  the  feedback 
gain  obtained  from  Kleinman’s  method  and  r(k)  is  the  reference  signal.  Besides  the 
control  objective  mentioned  in  Section  3.2,  some  other  specifications  can  be  given. 
One  of  the  important  specifications  is  to  require  an  output,  say  yi(k ),  of  the  plant 
to  track  the  reference  signal  r(k)  so  that  the  track  error 


\yi(k)  -r(k)\ 
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is  suitably  small.  To  meet  this  specification,  we  can  use  the  internal  model  principle. 
The  design  procedure  is  as  follows.  First,  we  realize  the  internal  model  from  the 
dynamics  of  the  reference  signal  and  external  disturbances.  Next,  we  augment  the 
plant  model  with  the  internal  model.  Finally,  we  design  a control  law  such  that  the 
augmented  model  is  asymptotically  stable. 

Suppose  that  r(k)  is  a step  command  and 

Vi(k)  = h y(k) 

where  an  1 x m vector  h is  given  by 

h = [1  0 •••  0]. 

The  internal  model  is  then  realized  as 


z(k  + 1)  = z(k)  + (h  y(k)  - r(k)). 


(5.24) 


Combining  the  state  observer  (5.1)  - (5.5)  and  the  internal  model  (5.24)  gives 


x(k  + 1) 
z{k  + 1) 


Fa{9{k),k) 


x(k) 
z(k) 

+ E\r(k)  + E2(k)v(k) 


+ Ga(0(k),  k)u(k) 


(5.25) 


where 


Fa(9(k),k) 


F(0(k),  k)  OnXl 
h H 1 


Ei  = 


OnXl 

-1 


Ga(9(k),k) 


G(9(k),  k) 

0 


; E2(k) 


M(k) 

h 


and  OrXs  denotes  an  r x s zero  matrix.  The  augmented  estimated  model  is  then 
denoted  by  a pair  (Fa(9(k),  k),  Ga(9(k),  k)).  For  this  system,  Kleinman’s  method 
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gives  the  feedback  gain  La(9(k),k)  and  with  this  feedback  gain  we  construct  the 
following  control  law: 


u{k)  = — La(9(k),  k) 


(5.26) 


_ *(*) 

Suppose  that  the  augmented  estimated  model  given  in  (5.25)  satisfies  the  fol- 
lowing assumption. 

Assumption  6. a:  There  is  £ > 0 such  that  for  any  9 G where  Cl  is  as  defined 
in  (3.27),  the  augmented  model  (Fa(9,k),Ga(9,k))  satisfies 


\det(YN(9,k))\  >e  \/k£Z+ 


where  Yiv(9,k)  is  its  reachability  grammian,  for  some  N G Z + . 

Then,  Theorem  5.2  can  be  applied  to  this  augmented  estimated  model.  However, 
Assumption  6 does  not  imply  Assumption  6. a,  so  the  control  law  (5.26)  may  result 
in  the  instability  of  the  closed-loop  system  ( Fa(9(k ),  k ) — Ga(9(k ),  k)La(9(k),  k)).  To 
avoid  this,  we  suggest  the  following  two  approaches.  The  first  approach  is  to  take  a 
set  such  that  it  satisfies  Assumption  6. a in  this  section  instead  of  Assumption  6 in 
Section  3.1.  The  second  approach  is  to  switch  the  control  law  (5.26)  to  that  of  the 
estimated  model  without  augmentation  when  the  augmented  model  approaches  to  a 
system  which  is  not  reachable.  As  for  the  augmented  model  wuth  the  internal  model 
(5.24),  this  can  occur  when  a transmission  zero  of  the  estimated  model  approaches 
1. 

Remark  5.2.  The  transfer  function  theories  for  linear  time- varying  systems  are 
based  on  a non-commutative  ring  structure  called  the  skew  ring  of  polynomials. 
However,  poles  and  zeros  can  be  defined  in  a very  similar  manner  to  the  linear 
time-invariant  systems.  See  [36,  45]  for  the  transfer  function  approach  to  linear 
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time-varying  systems,  and  [48]  for  the  definition  of  poles  and  zeros  of  linear  time- 
varying  systems. 

If  control  laws  are  switched,  the  output  may  not  track  the  reference  signal  very 
well.  However,  since  the  estimated  model  satisfies  Assumption  6 in  Section  3.1 
stability  of  the  closed-loop  system  ( F(6(k ),  k ) — G(9(k ),  k)L(0(k),  k))  is  guaranteed. 
Although  the  internal  model  presented  in  this  example  is  a simple  one,  the  same 
principle  applies  in  more  general  cases. 


CHAPTER  SIX 
ROBUST  STABILITY 


In  this  chapter,  we  show  that  the  control  law  (5.23)  and  an  adaptive  state 
observer  (5.1),  all  based  on  the  latest  estimates  0(k),  make  the  closed-loop  system 
robustly  globally  stable.  To  prove  this,  we  need  the  following: 

• A bounded  sequence  of  feedback  gains  which  makes  the  estimated  model  ex- 
ponentially stable. 

• An  estimation  algorithm  in  which  ||#(&)  — 6(k  — 1)||  converges  to  zero  and 
Lemma  4.3  ii)  is  satisfied. 

In  fact,  for  any  adaptive  controller  we  can  use  the  arguments  of  this  chapter  to 
prove  the  robust  stability  of  the  closed-loop  system  if  the  above  two  features  hold. 
Therefore,  the  arguments  in  this  chapter  are  also  valid  for  the  control  law  based  on 
the  internal  model  principle,  e.g.,  (5.26)  mutatis  mutandis. 

As  defined  in  Section  3.2,  robust  stability  means  that  there  exists  D*  > 0 
such  that  for  any  bounded  external  disturbances  d(k)  and  for  all  possible  plant 
uncertainties  6*(k)  which  satisfy 

sup{||<r(fc)||}  < D * 

the  closed-loop  system  is  stable. 

Theorem  6.1.  There  exists  D*  > 0 such  that  for  any  D and  W which  are  design 
parameters  used  in  calculating  the  size  of  dead  zone  with  0 < D < D* , and  for  any 
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bounded  external  disturbances  d(k)  and  plant  uncertainties  S*(k)  satisfying 

supwwii}  < /> 
sup{||d(fc)||}  < W 

the  closed-loop  system  described  by  the  plant  (3.1),  the  parameter  estimation  al- 
gorithm (4.4)-(4.6),  the  state  observer  (5.1)  and  the  control  law  (5.23)  is  globally 
stable. 

Proof.  From  (5.1)-(5.5),  we  have 

v(k)  = y(k)  - t MW  ~ 0,  k ) v(k  - *)  - E MW  - *')»  k ) W ~ *)•  l6-1) 

i=l  i=l 

On  the  other  hand,  it  follows  from  (4.1)  together  with  (3.6)  and  (5.6)-(5.8)  that 
e(k)  = y{k)  - 6\k  - l)W  - 1) 

= y(k)  - MG(k  - !)» k)y(k  - 0 - 1 Bi{W  - !)>  k)W  - *)•  (6-2) 

t=i  i=i 

Then  (6.1)  and  (6.2)  gives 

v(k)  = c(Jfc)  + i2(Jfc  - l)^(fc  - 1)  (6-3) 

where 

R(k  — 1)  = [An(k  — 1)  — A\\(k  — 1)  • • • Air(k  — 1)  — A\r{k  — 1)  • • • 

Api ( k — 1)  — Api(k  — p)  ■ • ■ Apr(k  — 1)  Apr{k  — p ) 

Bn{k  - 1)  - £n(*  — !)•••  Bsr{k  - 1)  - £sr(fc  - 5)  ].  (6.4) 


Note  that  A,j(k)  or  Bij(k)  in  (6.4)  is  a submatrix  of  9'{k).  Define  ip{k)  as 

^\k)  = [y\k)y\k  - l)  ■ ■ ■ y'(k  - p + l)  u{k)  ■ ■ ■ u{k  - s + l)}. 


(6.5) 
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Then  (3.6)  gives 


<f>{k)  = N(k)xl>{k)  (6.6) 

where  the  elements  of  N(k)  are  either  zero  or  one  of  the  functions  /i(fc+l), ...,  fr{k  + 
1).  Since  fj(k),j  = are  bounded,  for  some  N*  > 0 


||JV(lfe)||  < N*  Vfc  G Z+. 

Since  R{k)  -*  0 as  k -*  oo  by  Lemma  4.3  iii)  and  n(k)  < 1 + U(k)\\  from  (4.3), 
Lemma  4.3  ii),  together  with  (6.3)  and  (6.6),  implies  that  for  given  e > 0 there  is 
T G Z+  such  that  for  any  k >T 


||u(fc)||  - ( I<D  + £)N*\\xp(k  - 1)||  - KW  < e(l  + Nm\\tl>(k  - 1)||).  (6.7) 


Now  using  the  state  observer  (5.2)  and  the  control  law  (5.23),  we  can  write 


(6.5)  as 


V>(fc)  = Sip{k  - 1)  + D(k)x(k)  + Ei(k) 


(6.8) 


where 


Omxm  ' ' ' Om'X.m  f^rnxl  ’ ’ ' OrnX.1 

OmXm 

Im(p-l)  : Om(p-i)xs 

Omxm 

Oixm  ’ ’ ' Oixm  0 ' ' ' ® 

0 

0(j-l)xmp  7s_i 


(6.9) 


0 
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m = 


H 

vik) 

0 

0 

0 

! Ex{k)  = 

0 

-L(0{k),k) 

r(7) 

0 

0 

0 

0 

(6.10) 


and  7r,  OrXS  denote  anrxr  identity  matrix  and  an  r x s zero  matrix,  respectively. 
Define 


z(k  + 1)  = [ tj){k)  x(A;  + l)].  (6-11) 

Inserting  (5.23)  into  (5.1)  and  combining  this  result  with  (6.8)  gives 

z(k  + 1)  = A{k)z(k)  + E{k)  (6.12) 


where 


A(k)  = 


S D(k) 

0 F{6(k),k)-G(6{k),k)L(6{k),k ) 


(6.13) 


m = 


Erik) 


(6.14) 


M{k)v(k)  + G(6(k),k)r(k) 

By  the  theorem  5.2,  L{0(k),k)  is  bounded  ( thus  D{k)  is  bounded  ) and 
{F(6{k),k)  - G(9(k),k)L{0(k),k))  is  exponentially  stable.  Moreover,  S'  is  a con- 
stant exponentially  stable  matrix  and  thus  there  are  0 < a < 1 and  (3  > 1 such 


that 


H*(t  + <,t)||</Ja'  Vt,(ez+. 


(6.15) 
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where  3>(fc  + t,  k)  is  the  transition  matrix  of  A(k).  Then  from  (6.12)  we  get 

||*(*  + Oil  < ^\\z(k)\\  + £ fiaiM\\E{k  + Oil-  (6-16) 

t=0 

Assume  that  the  reference  signal  r(k)  is  bounded,  i.e., 

|r(fc)|  < R*  Vk  6 £+  for  some  R*.  (6-17) 

Since  G(0(k),k),  M(k)  defined  as  (5.9)  are  uniformly  bounded  by  Lemma  4.3  iii), 
E(k)  in  (6.14)  gives 

||£(Jfc)||  < (1  + M*)||u(fc)||  + (1  + G*)R*  (6.18) 

where 

||M(fc)||  < M* 

\\G(e(k),k)\\  <G \ 

Now,  it  follows  from  (6.7)  and  (6.18)  that 

||£(Jfe)||  < (1  + M*)(KD  + 2e)N*\\ip(k  — 1)||  + (1  + G*)R* 

+ (1  + M*)(KW  + e)  Vfc  > T.  (6.19) 

From  (6.16)  and  (6.19),  noting  that  (6.11)  gives 

II Hk  - 1)||  < ||z(fc)ll 

we  get  for  any  k > T 

||*(fc  + OII  < Pa'PWW  + '£,ci<xt~'~1\\z(k  + i)\\  + X>2a— 1 

« — n 


(6.20) 
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where 


ci  = P(l+M*)(KD  + 2e)N* 

c2  = 0(1  + M*)(KW  + e)  + 0(1  + G*)R *. 

Using  the  Gronwall  Lemma  which  will  be  proved  at  the  end  of  this  chapter, 

IK*  + OII«-t  < (l+c1/a)t0\\z(k)\\  + c2a-t  + 

4—2 

c2  E(!  + Ci/ct)*-*-' la-'-1  VA:  > T. 

«=o 

Thus,  if  a + cx  < 1,  we  get 

IK*  + Oil  < (a  + eO^lkWII  + r-  /2,  VA:  > T.  (6.21) 

1 - (a  + ci) 

Choose  D*  such  that 

n*  _ ^ ~ a nn\ 

0(1  + M*)KN*  I<  [ ’ 

which  gives 

a + ca<l  V D < D*. 

Note  that  N(k),  P(k),  M(k ) and  G(6(k),k)  are  all  uniformly  bounded.  Therefore, 
N *,  K , Af*  and  G*  exist  which  are  not  dependent  on  D.  It  then  follows  that 
z(k  + t ) is  bounded  for  any  bounded  reference  signal,  for  any  model  uncertainties 
with  D\  < D < D* , for  any  external  disturbances  with  W\  < W and  for  any  initial 
values  of  z(k).  ■ 

The  essential  idea  in  proving  Theorem  6.1  is  to  use  the  equation  (6.12)  and 
the  Gronwall  lemma.  The  equation  (6.12)  was  first  introduced  by  Samson  [8]  and 
thereafter  became  an  important  tool  to  prove  the  closed-loop  stability  for  indirect 
adaptive  control  systems.  See  also  Ossman  and  Kamen  [13]. 
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We  now  prove  the  Gronwall  lemma.  For  convenience  in  notation,  the  summation 
denoted  by  YliLto  or  ^e  product  denoted  by  n|Lt0  will  be  taken  as  zero  if  t\  < t0. 
Lemma  6.1  (Discrete-time  version  of  Gronwall’s  lemma). 

Suppose  that  c(<),  d(t),  r(t)  and  k(t)  are  nonnegative  valued  functions  on  Z+ 
such  that 


t-i 


r(t)  < c(t)  -f  ^(*)r(*)  Vf  E Z~ 
j=o 


(6.23) 


and 


t- 1 


c(<)  = ^d(i)  + c(0). 
»=o 


(6.24) 


Then  for  any  t E Z+ , 


<-i 


t— 2 I t-1 


r(()  < 11(1  + HOMO)  + d(t-  1)  + £ i II  (1  + Hi))  } i(0-  (6.25) 

i=0  I j—i+1  J 


»=0 


Proof.  The  proof  roughly  follows  the  proof  in  the  continuous-time  case  [21,  49]. 


Let 


<- 1 


s(t)  = c(t ) + ^2  k(i)r(i)  for  t > 0 ; 5(0)  = c(0). 


(6.26) 


i=0 


Then 


r(t)  < s{t). 


(6.27) 


Further 


s(i)  — s(i  — 1)  = k(i  — l)r(i  — 1)  -f  d(i  — 1)  for  i = 1, ...,  t. 
It  then  follows  that 

5(f)  — (1  + k(i  - l))5(i  - 1)  < d(i  - 1)  for  i = 1, 


69 


Thus 


s(t)  “ IK1  + ^(0)5(°)  < d(t  - 1)  + J2  I II  + kU))  f d(0- 

i=0  i=0  (j  = i + l J 


Since  s(0)  = c(0)  and  r(t)  < s(f),  we  get  (6.25).  ■ 


(6.28) 


CHAPTER  SEVEN 
DESIGN  EXAMPLE 


In  this  chapter,  we  apply  our  adaptive  control  algorithm  to  a design  of  the 
normal  acceration  controller  for  missiles.  For  this  study,  we  choose  a generic  air- 
to-air  BTT  (Bank-To-Turn)  missile  which  has  been  used  to  develop  an  adaptive 
autopilot  by  Rockwell  [50]  and  in  a number  of  other  studies  (see,  e.g.,  Caughlin  and 
Bullock  [51]).  This  model  has  an  advantage  that  a large  scale  simulation  based  on 
the  6 DOF  (Degree-Of-Freedom)  equations  of  motion  is  available  [52]. 

A typical  trajectory  is  chosen  for  evaluation.  This  benchmark  trajectory  in- 
volves a hard  turn  which  requires  an  immediate  90  degree  bank  and  near  maximum 
accerleration  to  intercept  the  target.  For  the  initial  range  taken,  the  flight  dura- 
tion is  just  over  3 seconds.  The  guidance  law  is  activated  0.4  second  after  launch, 
allowing  the  missile  to  clear  the  aircraft.  The  missile  is  accelerated  for  2.6  seconds 
until  the  fuel  is  exhausted.  The  velocity  changes  from  Mach  0.9  to  Mach  2.6  and 
the  dynamic  pressure  ranges  from  900  psf  to  8000  psf.  Due  to  the  rapid  change  of 
velocity  and  dynamic  pressure,  the  plant  parameters  also  change  rapidly. 

7.1  Plant  Model 

From  the  results  of  standard  derivation  for  missile  dynamics  [53,  54],  we  have 
a linearized  model  for  the  longitudinal  motion  described  as  the  second  order  state 
equation  whose  states  are  the  velocity  in  z-direction  w(k)  and  the  pitch  rate  q(k). 
Assuming  that  the  normal  acceration  Az{k)  and  the  pitch  rate  q(k)  are  directly 
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measurable,  we  have  the  following  discrete-time  plant  model. 

x(k)  = F(k)x{k  - 1)  + G{k)u(k  - 1)  (7.1) 

y(k)  = C(k)x(k)  + D(k)u(k)  (7.2) 

where 

x'(k ) = [w(k)  q(k )] 

V#(*)  = [A,(k)q(k) } 

and  u(k ) is  the  deflection  angle  of  the  control  surface  (the  elevator  or  the  carnard) 
and  F(k),G(k),C(k),D(k)  are  all  time-varying  matrices  which  have  proper  dimen- 
sions. C(k)  and  D(k)  are  in  the  form 


c(k)  0 

; D(k)  = 

d(k) 

C(k)  = 

0 1 

0 

and  F(k)  and  G[k ) are  general  time- varying  matrices.  Figures  7.1  - 7.4  show  entries 
of  F(k)  and  G(k),  c(k)  and  d(k)  for  the  benchmark  trajectory.  This  discrete-time 
model  was  obtained  from  the  continuous-time  model  with  the  following  sampling 
interval  ts  (see,  for  details,  [55]). 

ts  = 0.01  sec. 

Let  y~(k)  denote  the  output  measured  just  before  the  control  input  u(k)  is  ap- 
plied. Since  the  discrete-time  model  is  obtained  from  the  continuous-time  model  by 
assuming  that  the  control  input  is  constant  between  two  adjacent  sampling  intervals, 
y~(k)  can  be  expressed  as 


y~(k)  = C(k)x(k)  + D(k)u{k  - 1). 


(7.4) 
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Figure  7.1:  Entries  of  the  system  matrix  F(k).  F(i,j ) denotes  the  th  element 
of  F(k). 


Figure  7.2:  Entries  of  the  input  matrix  G(k).  G(i)  denotes  the  z-th  element  of  G(k). 


-3 


Figure  7.3:  An  element  c(fc)  of  the  output  matrix  C(k). 


Figure  7.4:  An  element  d(k ) of  the  matrix  D(k). 
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where  C(k)  and  D(k ) are  as  in  (7.2). 

We  can  choose  either  y(k)  or  y~(k ) for  the  parameter  estimation.  However, 
the  choice  will  result  in  different  control  algorithms.  If  we  choose  y(k),  we  will  have 
an  algebraic  loop  in  the  closed-loop  system  due  to  the  direct  term  D(k)u(k).  To 
avoid  this  nuisance,  we  will  choose  y~(k).  Then,  we  have  to  calculate  the  parameter 
estimates  first,  and  the  control  input  next. 

Since  c{k)  ^ 0 (see  Fig.  7.3),  det{C(k)}  ^ 0.  Therefore,  using  (7.1)  we  can 
restate  y~(k)  as 

y~(k)  = A{k)y~(k  - 1)  + B1(k)u(k  - 1)  + B2(k)u(k  - 2)  (7.5) 

where 

A(k)  = C(k)F(k)C~1(k  — 1) 

B^k)  = C(k)G{k)  + D(k ) (7.6) 

B2{k)  = -A(k)D(k- 1). 

The  equation  (7.5)  is  the  plant  model  whose  parameters  are  to  be  estimated. 

It  is  known  a priori  that  the  dynamic  pressure  increases  until  the  fuel  is  ex- 
hausted and  thereafter  it  decreases.  Moreover,  it  is  known  that  the  fuel  is  exhausted 
at 

tc  = 2.6  seconds. 

As  can  be  seen  in  Figures  7.1  - 7.4,  the  magnitudes  of  plant  parameters  increase 
almost  linearly  until  the  fuel  is  exhausted  and  thereafter  they  decrease.  Therefore, 
the  parameters  can  be  approximated  by  linear  curves  until  tc  seconds  and  after  that 
by  a monotonically  decreasing  function  which  goes  to  zero  as  time  goes  to  infinity. 
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From  this  information,  we  will  assume  the  structure  of  parameter  variations  as 


/i(*)  = i;  f2(k)  = { 


t sk 

t2c/(tsk) 


tsk  < tc 
otherwise 


U-7) 


where  ts  denotes  the  sampling  interval  0.01  sec.  Thereby,  we  can  write  A(k),  B,(k ) 
and  B2(k)  as 


A(k)  = J2(Ai  + AAi(k))fi(k) 

i=l 

Bi(k ) = £(£«  + AB,,(k))f,(k)  for  t = 1,2. 

J = 1 


Inserting  (7.8)  and  (7.9)  into  (7.5)  gives 


(7.8) 

(7.9) 


y-(k)  = (e  + S(k)Y<f,(k-  1)  (7.10) 

where  6 is  the  8x2  parameter  matrix,  6(k)  is  the  8x2  uncertainty  matrix  and  <f>(k) 
is  the  8x1  regression  vector,  given  by 

O'  — [A\  A2  fin  fii2  B2\  B22]  (7.11) 

6\k)  = [ AAi(k)  AA2(k)  A fin(fc)  Afii2(fc)  A fi21(fc)  A B22(k)  ] (7.12) 

<t>\k  - 1)  = [fi(k)y-(k  - 1 )'  Mk)y~(k  - 1 )'  A (k)u(k  - 1) 

f2(k)u(k  - 1)  fi(k)u(k  - 2)  f2(k)u(k  - 2)].  (7.13) 


7.2  Estimated  Model 

Associated  with  (7.10),  the  prediction  error  e(k)  is  given  by 


e(k)  = y ( k ) - 0'(k  - 1 )<f>(k  - 1) 


(7,14) 
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Table  7.1:  Possible  parameter  ranges 


Maximum  values 

' 1.05  -0.3  0.1  2.5  10  25  50  25 

0.2  1.05  0.1  0.1  -5  25  2 25  J 

Minimum  values 

' 0.85  -6  -0.1  -2.5  -50  -25  -50  -25 

0.04  0.85  -0.1  -0.1  -50  -25  -2  -25 

where  9(k  — 1)  is  the  estimate  of  6 at  time  ( k — 1). 

Using  the  linearized  model  for  the  benchmark  trajectory  (Fig.  7.1  - 7.4),  the 
nominal  A(k ),  Bi(k)  and  B2(k)  were  calculated.  Based  on  these  curves,  the  possible 
ranges  of  the  tuned  parameters  were  taken  as  in  Table  7.1. 

In  order  to  use  the  state  space  model  (7.1)  and  (7.4),  we  must  find  F(l :),  G(k), 
C(k)  and  D[k ) from  the  estimates  of  A(k ),  B\(k)  and  B2(k)  using  (7.6).  However, 
equation  (7.6)  cannot  be  solved  uniquely  for  F(k ),  G(k),  C(k ) and  D(k).  Therefore, 
we  use  a different  model  which  is  easy  to  construct  from  the  estimated  A(k),B i(k) 
and  B2(k). 

Let 

z(k)  = C{k)x(k).  (7.15) 

Then,  it  follows  from  the  original  state  equations  (7.1), (7. 4)  and  the  definition  of 
y~(k)  (7.5)  that 

z{k)=y~(k)-D(k)u{k-  1)  (7.16) 


z{k  + 1)  = A{k  + l)£(fc)  + B(k  + 1 )u(k) 


(7.17) 
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where 

D(k)  = -A~1(k  + l)B2{k  + 1)  (7.18) 

B{k  + l)  = Bi{k  + l)  - D(k+l).  (7.19) 

Remark  7.1.  According  to  (3.1),  this  example  corresponds  to  the  case  that 
p = 1 and  s = 2.  Thus,  we  have  an  observable  realization  of  (7.10)  in  the  form  of 
(3.20)  and  (3.21).  This  result  is  exactly  same  as  (7.16)-(7.19). 

The  set  composed  of  the  possible  parameter  ranges  (Table  7.1)  contains  some 
points  which  do  not  satisfy  the  reachability  condition  or  the  invertibility  condition  of 
Ap(9 , k ) (Assumption  6 or  7 in  Section  3.1).  If  we  exclude  such  points  from  this  set 
(call  the  resulting  set  ft,*),  ft*  will  not  be  a connected  set  and  thus  not  a hypercube 
as  in  Assumption  6.  Furthermore,  any  hypercube  as  a subset  of  ft*  which  satisfies 
the  reachability  condition  and  the  invertibility  condition  of  Ap(9,  k)  (Assumption  6 
and  7 in  Section  3.1)  is  so  small  in  size  that  we  do  not  need  to  use  adaptive  control. 

An  alternative  to  avoid  these  technical  difficulties  is  to  weaken  Assumption  6 
and  7 in  Section  3.1  as  the  following  which  allow  a hypercube  ft  with  reasonable 
size. 

Al.  There  is  e > 0 such  that  for  any  9 E ft,  an  observable  realization  of 

y(k)  = 9'<f>(k  - 1) 

satisfies 

liminf{|det(Y/v(0,  k))\}  > £ 

k 

where  Y]^(9,k)  is  its  reachability  grammian. 

A2.  If  p < s,  then  there  is  S\  > 0 such  that  for  any  9 E ft, 

liminf{|det(Ap(0,  k))|}  > £i 
k 
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where 

Ar(e,k)  = fl  AviM) 

j- 1 

and  Apj,  j — 1, . . . , r are  submatrices  of  9'  for  any  # £ fh 

Remark  7.2.  Assumption  A1  implies  that  for  a fixed  9 £ fi,  det(F/v(0,  k))  can 
be  zero,  but  it  must  not  occur  infinitely  many  times.  In  other  words,  there  exists  a 
positive  integer  T such  that  for  any  9 £ Q its  observable  realization  ( F(9 , k ),  G(9 , k )) 
is  uniformly  reachable  in  N steps  for  k £ [T,  oo).  When  the  estimated  model  is  not 
reachable  at  time  k < T,  we  use  any  bounded  feedback  gains. 

As  for  the  example  in  this  chapter,  this  alternative  is  not  very  helpful  because 
the  flight  duration  is  just  3 seconds.  However,  we  will  show  that  the  set  of  the 
possible  parameter  ranges  (call  it  fl)  satisfies  assumptions  A1  and  A2  and  use  this 
set  in  the  computer  simulations  to  see  how  the  projection  scheme  works. 

Now  we  check  that  the  set  fl  satisfies  assumptions  Al  and  A2.  For  the  system 
(7.17),  the  2-step  reachability  matrix  R2(9,k)  for  9 £ ff  is  given  by 

R2(e,k)  = [B(9,k  + 1)  A(9,  k + l)B(9,  &)].  (7.20) 

Since  f2{k)  — > 0 as  k — > oo,  R2(9,k ) converges  to  a certain  matrix  R2(9,o o)  where 

R2(9,  oo)  = Aj  x[  AiBn  + B2\  A\(A\B\\  + B2\)  ] (7-21) 

and  Ai,  B n,  B2\  are  submatrices  of  9 as  in  (7.11).  From  the  data  in  Table  7.1,  it  can 
be  easily  shown  that  A\  is  invertible  for  any  9 £ fL  To  show  that  det{i?2($>  °°)}  7^  0? 
let 

ei 
«2 


— + B2\. 
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Then,  from  Table  7.1,  we  get  e2  < 0.  Define 

t - ex/e2. 

Then,  det{f?2($, oo)}  = 0 implies  that 

a2it2  T (a22  — an)t  — a,\2  — 0 (7.22) 


where 


A\  — 


a,\\  a\2 

a2i  a22 

In  order  for  the  equation  (7.22)  to  have  real  solutions,  it  should  satisfy 


(a22  — «n)2  + 4a2iai2  > 0 


(7.23) 


which,  from  the  possible  ranges  of  an  and  a22  in  Table  7.1,  gives 


a21a12  ^ —0.01. 

But,  we  have  from  the  possible  ranges  of  an  and  ai2 


(7.24) 


a21a12  5:  —0.012 


which  contradicts  (7.24).  Therefore,  for  any  0 E D det{i?2(0,  oo)}  ^ 0 which  implies 
that  satisfies  Assumption  6 in  Section  3.1.  However,  as  discussed  in  Section  5.5, 
the  augmented  estimated  model  based  on  the  internal  model  principle  may  not 
satisfy  the  reachability  assumption  like  A1  with  this  Q. 


7.3  Controller  Structure 

For  Az(k ) to  track  a commanded  acceleration  r(k)  so  that  the  track  error 


\Az(k)-r(k)\ 
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is  suitably  small,  we  use  the  internal  model  principle.  Suppose  that  r(k)  is  a step 
command,  i.e., 

r(k)  — a for  some  a 6 7Z. 

By  the  standard  procedure  of  augmenting  the  system  model,  we  define 


v(k  + 1)  = v(k)  + ( Az(k ) - r(k)) 


(7.25) 


Since  Az(k)  is  the  first  component  of  y(k),  we  have  from  (7.2)  and  (7.15) 


v(k  -f  1)  = v(k)  + [ 1 0](-z(fc)  + D(k)u(k))  — r(k). 


(7.26) 


Let  z(k)  and  v(k)  denote  the  estimated  z(k)  and  v(k),  respectively.  Then, 
from  (7.17)  and  (7.26),  using  the  latest  estimates,  we  have  the  following  augmented 
estimated  model. 


z(k  + 1) 
v(k  + 1) 


A{6(k),k  + 1) 

1 0 

B(6(k),k+  1) 
D1(6(k),k) 


0 

0 

1 


z{k) 

v(k) 


+ 


u(k)  + 


0 

0 

—r(k) 


where 


Di(6(k),  k)  = [10 }D(6(k),k). 


(7,27) 


For  the  system  (7.27),  Kleinman’s  method  gives  the  following  control  law. 


u{k ) = —L(9(k),  k) 


z{k) 

v(k) 


(7.28) 
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As  mentioned  in  Section  5.5  the  rachability  of  the  estimated  model  does  not 
imply  the  rechability  of  the  augmented  estimated  model,  so  the  control  law  (7.28) 
may  result  in  the  instability  of  the  closed-loop.  To  avoid  this,  we  take  the  second 
approach  suggested  in  Section  5.5.  If  the  augmented  model  converges  to  a system 
which  is  not  reachable,  the  control  law  (7.28)  will  switch  to  that  of  the  estimated 
model  without  augmentation. 

In  this  example,  the  number  of  outputs  is  the  same  as  the  number  of  states, 
because  p = 1.  Thus,  no  dynamic  observer  is  required.  The  non-dynamic  state 
observer  is  obtained  from  (7.16)  as 

z(k)  = y-(fc)  - D(6(k),k)u(k  - 1).  (7.29) 

On  the  other  hand,  since  inserting  (7.16)  into  (7.26)  gives 
v(k)  = v(k  — 1)  — r(k  — 1)  + 

[ 1 0]{y~(k  - 1)  + D(k  - 1 ){u(k  - 1)  - u(k  - 2))}.  (7.30) 

v(k)  is  then  given  by 

v(k ) = v(k  — 1)  — r(k  — 1)  + 

[1  0 }{y-(k  - 1)  + D(9{k),  k - 1 ){u{k  - 1)  - u(k  - 2))}.  (7.31) 

7.4  Simulation  Results 


To  see  the  performance  of  the  proposed  adaptive  control  algorithm  implemented 
in  the  normal  acceleration  controller,  computer  simulations  were  generated  with 
different  design  parameters  and  initial  values,  by  using  the  linearized  aerodynamic 
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model  or  the  large  scale  simulation  program.  Among  these  results,  we  demonstrate 
the  following  four  cases  obtained  from  the  large  scale  simulations.  For  all  4 cases, 
the  update  intervals  of  control  and  estimation  are  chosen  to  be  0.01  sec. 

Case  1. 


This  case  shows  how  the  proposed  algorithm  works  when  the  estimation  scheme 
takes  the  structure  of  the  parameter  variations  as  described  in  (7.7).  We  do  not  use 
any  dead  zone  in  this  case.  Also,  no  projection  scheme  is  used.  To  ensure  that 
the  missile  clears  the  aircraft  from  which  it  was  discharged  the  guidance  law  is 
not  activated  for  0.4  seconds  after  launch,  however  in  this  interval  the  pitch  rate 
feedback  loop  is  active  regulating  the  pitch  rate  to  zero.  For  this  purpose,  the 
following  constant  feedback  gain  L0,  with  which  the  closed- loop  of  the  nominal 
plant  (Fig.  7.1  - 7.4)  is  stable,  will  be  used: 


L0  = [0  - 0.13  0 ]. 


(7.32) 


The  initial  value  of  the  parameter  esimate  9(k)  was  chosen  within  the  set 
and  the  initial  value  of  the  covariance  matrix  P(k)  was  taken  as  a diagonal  matrix 
whose  diagonal  elements  approximate  the  squares  of  possible  parameter  ranges. 


0(0) 


0.95 

-0.5 

0 

-2  -20 

10  25  10 

0.05 

0.95 

-0.05 

0 -12 

-10  1 -1 

(7.33) 
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P(  0) 


0.1  0 0 0 0 0 0 0 
0 10  000  0 0 0 
0 0 0.1  0 0 0 0 0 
000  10  0 0 0 0 
0 0 0 0 1000  0 0 0 

0 0 0 0 0 1000  0 0 

0 0 0 0 0 0 1000  0 

0 0 0 0 0 0 0 1000 


(7.34) 


A guide  to  choose  the  value  of  a design  parameter  N( 0)  which  is  used  to  cal- 
culate a forgetting  factor  A (k)  is  to  follow  the  constant  information  principle,  i.e., 


N(0)  = 0'(O)P-1(O)0(O). 


with  the  guess  of  0(0).  If  we  choose  a larger  N( 0),  the  resulting  A (k)  will  be  closer 
to  1.  In  this  example,  N(0)  was  chosen  as 


Ar(0)  = 1. 


Since  no  dead  zone  is  used, 

D = 0. 

Figures  7.5  - 7.8  show  a set  of  results  obtained  from  this  simulation.  In  Fig.  7.7  - 
7.8,  the  nominal  system  parameters  (which  were  obtained  from  Fig.  7.1  - 7.4)  are 
marked  “Reference.” 


Case  2. 

This  case  shows  how  the  proposed  algorithm  works  when  the  estimation  scheme 
takes  the  parameters  to  be  estimated  as  piecewise  constant,  i.e., 
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As  in  Case  1,  neither  projection  nor  dead  zone  is  used.  The  initial  value  0(0)  was 
chosen  so  that  the  elements  of  A(k),  Bi(k ) and  B2(k)  have  the  same  values  as  in 


Case  1 at  0.4  second  and  P(0)  was  chosen  similar  to  Case  1.  The  value  of  fV(0)  was 
selected  so  that  the  tracking  errors  of  the  normal  acceleration  became  as  small  as 


possible. 


0(0)  = 


P(0)  = 


0.95 

-1.3 

— 

16  29 

0.03 

0.95 

— 

16  0.6 

0.1 

0 

0 

0 

0 

10 

0 

0 

0 

0 1000 

0 

0 

0 

0 

1000 

N(  0)  = 0.0001. 


(7.35) 


(7.36) 


(7.37) 


The  results  are  shown  in  Fig.  7.9  - 7.12.  As  expected,  the  simulation  results  showed 
poor  performance.  Further  disscussion  will  be  given  in  Section  7.5. 


Case  3. 


This  case  agrees  with  Case  1 except  for  0(0)  and  N(0).  The  initial  value  0(0) 
which  is  more  inaccurate  than  in  Case  1 with  respect  to  the  reference  was  taken. 
The  value  of  N(0)  was  selected  so  that  the  tracking  errors  of  the  normal  acceleration 
became  as  small  as  possible  as  in  Case  2. 


0(0)  = 


0.8  -0.5  0 0 10  0 10  0 

0 1 0 0 -10  0 0 0 


(7.38) 


N{  0)  = 0.01. 


(7.39) 


A set  of  responses  for  this  simulation  is  shown  in  Fig.  7.13 


7.16. 
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Case  4. 

This  case  agrees  with  Case  2 except  for  the  value  of  N( 0)  which  is  much  larger 
than  in  Case  2. 

N{  0)  = 1.  (7.40) 

See  Fig.  7.17  - 7.20.  The  resulting  forgetting  factors  were  much  closer  to  1 than 
Case  2.  Consequently,  the  parameter  estimates  converged  to  some  constants  when 
the  system  was  stable,  which  caused  bad  oscillations  at  1.3  second  and  2.6  second. 

7.5  Discussion 

Since  we  use  a variable  forgetting  factor,  the  estimation  algorithm  is  allowed,  to 
some  extent,  to  track  the  time- varying  parameters.  However,  when  we  compare  the 
results  of  Case  1 or  Case  3 (where  parameters  are  assumed  as  linear  curves)  with 
those  of  Case  2 or  Case  4 (where  parameters  are  asumed  as  piecewise  constants), 
we  see  that  the  performance  of  the  latter  is  much  worse  than  the  former.  This  is 
mainly  due  to  the  following  facts. 

First,  when  the  value  of  N(0)  used  was  relatively  large  (Case  4),  while  the  sys- 
tem was  stable  the  forgetting  factors  were  close  to  1 and  consequently  the  elements 
of  F(9(k),k)  and  G(9(k),k)  converged  to  some  constants  (See  Fig.  7.19).  How- 
ever, in  Case  1 or  Case  3,  these  elements  vary  due  to  /2 ( ) , even  if  the  estimated 
parameters  are  constant.  It  is  obvious  that  the  assumed  structure  of  parameter  vari- 
ations in  Case  1 or  Case  3 has  the  ability  of  more  accurate  time-varying  elements 
representation  so  that  the  resulting  control  law  could  give  better  performance  if  the 
parameters  are  properly  identified.  On  the  other  hand,  when  the  value  of  N(0) 
used  was  relatively  small  (Case  2),  the  ability  of  tracking  time- varying  parameters 
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was  improved,  but  the  resulting  estimated  parameters  changed  rapidly  which  caused 
oscillatory  motions  (see  Fig.  9). 

Secondly,  when  we  calculate  the  feedback  gains  using  Kleinman’s  method,  fu- 
ture values  of  the  system  and  input  matrices  are  required.  This  implies  that  the 
time- varying  feedback  gains  depend  on  the  structure  of  parameter  variations  (de- 
scribed as  known  functions  fi(k),i  = l,...,r).  In  Case  2 or  Case  4,  the  assumed 
parameter  structure 

/i(*)  = 1 

takes  the  parameters  as  constant.  Therefore,  the  resulting  control  law  becomes  a 
controller  based  on  the  frozen  coefficients  which  may  fail  to  work  for  the  plants 
which  are  not  slowly  time- varying  (see  Section  2.7). 

In  the  theory  developed  in  Chapter  Three  to  Chapter  Six,  the  set  f l had  a 
number  of  assumed  properties  and  projection  to  f 1 was  used  whenever  the  parameter 
estimates  were  outside  of  f l.  In  fact,  in  the  simulations  no  significant  difference  of 
performance  with  and  without  projection  could  be  seen.  See  Fig.  7.21  - 7.23.  The 
assumed  structure  of  parameter  variations,  0(0)  and  P(0)  used  in  these  simulations 
are  same  as  in  Case  3 and  no  dead  zone  was  used.  However,  N(0)  was  taken  to  be 
1 which  is  much  larger  than  in  Case  3.  Without  projection,  Fig.  7.21  shows  the 
Frobenius  norm  of  the  distance  from  0(k)  to  H which  is  denoted  by  ||0(jfc)  — tt\\F 
(see,  for  the  definition  of  the  Frobenius  norm,  Section  5.3  or  [47])  and  Figures  7.22  - 
7.23  show  the  response  and  the  feedback  gains  with  and  without  projection.  In  this 
example,  the  tracking  of  the  commanded  acceleration  is  good  in  both  cases  except 
for  a switching  transient  at  1.7  second.  We  may  conclude,  for  this  example,  that 
the  projection  scheme  is  unnecessary  complication  in  practice. 
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For  each  of  4 cases  the  norm  of  feedback  gains  was  properly  bounded  (see  Fig. 
7.8,  7.12,  7.16  and  7.20).  This  implies  that  for  each  case  the  augmented  estimated 
model  did  not  violate  the  reachability  condition  (Assumption  6. a in  Section  5.5). 

D is  a design  parameter  related  with  the  norm  of  the  plant  uncertainties.  To 
prove  the  robust  stability  of  the  closed  loop,  we  assumed  that  D > D\  where  D\ 
is  the  supremum  of  the  plant  uncertainties.  However,  in  practical  applications  the 
value  of  D should  be  as  small  as  possible  because,  in  general,  the  robustness  can 
be  achieved  at  the  price  of  performance.  Using  a dead  zone  shows  slightly  better 
performance  than  otherwise,  when  a design  parameter  D is  properly  selected.  See 
Fig.  7.24  - 7.26  where  the  results  marked  “Without  Dead  Zone”  represent  the  case 
“Without  Projection”  in  Fig.  7.21  - 7.23  while  the  results  marked  “With  Dead 
Zone”  represent  the  same  case  except  that  D = 0.01.  As  shown  in  Fig.  7.24  -7.26, 
the  difference  is  negligible.  In  spite  of  various  plant  uncertainties,  the  large  scale 
simuations  without  deadzone  gave  good  performance.  This  fact  implies  that  our 
adaptive  control  algorithm  without  dead  zone  is,  to  a good  extent,  robust  to  the 
plant  uncertainties. 
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Figure  7.5:  Response  of  the  normal  acceleration  .V,  for  Case  1. 


Figure  7.6:  X(k)  for  Case  1. 
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Figure  7.7:  Some  elements  of  estimated  A(k),  B\(k ) and  B2(k)  for  Case  1.  A(i,j) 
denotes  the  (i,  j)-th  element  of  A(k)  and  Bi(j)  denotes  the  j-th  element  of  B^k). 
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Figure  7.8:  Feedback  gain  L(k)  for  Case  1.  L(j)  denotes  the  j-th  element  of  L(k). 
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Figure  7.9:  Response  of  the  normal  acceleration  Nz  for  Case  2. 


Figure  7.10:  A (k)  for  Case  2. 
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Figure  7.11:  Some  elements  of  estimated  A(k),  Bi(k)  and  B2(k)  for  Case  2.  A{i,  j) 
denotes  the  (z,j)-th  element  of  A(k)  and  Bi(j ) denotes  the  j-th  element  of  Bi(k). 
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Figure  7.12:  Feedback  gain  L(k)  for  Case  2.  L(j ) denotes  the  j-th  element  of  L(k). 
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Figure  7.13:  Response  of  the  normal  acceleration  Nz  for  Case  3. 


Figure  7.14:  \(k)  for  Case  3. 
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Figure  7.15:  Some  elements  of  estimated  A(k),  Bi(k)  and  B2(k)  for  Case  3.  A(i,  j) 
denotes  the  (i,j)-th  element  of  A(k)  and  Bi(j)  denotes  the  j-th  element  of  Bi(k). 
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Figure  7.16:  Feedback  gain  L(k)  for  Case  3.  L(j)  denotes  the  j- th  element  of  L(k). 


Figure  7.17:  Response  of  the  normal  acceleration  Nz  for  Case  4. 


l.l 


l - 

0.9  - 

0.8  - 

0.7 

0.6  - 

0.5  - 

0.4- 

0.3  - 

0.2  - 

0.1  - 
0 


0.5 


1.5  2 2.5 

time  (sec) 


3.5 


Figure  7.18:  X(k)  for  Case  4. 


98 


time  (sec) 


70 

60- 

50- 

40- 

30- 

20 


Reference 

Estimated 


V, 


Estimated  B2(^l) 


0.5 


1.5  2 

time  (sec) 


2.5 


3.5 


-10 
-15  ■ 
-20- 
-25  ■ 
-30- 
-35  L 


Estimated  Bl(2) 


Reference 

Estimated 


0.5 


.5  2 

time  (sec) 


2.5 


3.5 


Figure  7.19:  Some  elements  of  estimated  A(k),  Bi(k ) and  B2(k)  for  Case  4.  A(i,j ) 
denotes  the  (i, i)-th  element  of  A(k)  and  Bi(j)  denotes  the  j-th  element  of  Bt(k). 
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Figure  7.20:  Feedback  gain  L(k)  for  Case  4.  L(j)  denotes  the  j-th  element  of  L(k). 


100 


Figure  7.21:  ||0(fc)  — Q||jr  for  the  case  without  projection. 


Figure  7.22:  Responses  of  the  normal  acceleration  Nz  with  and  without  projection. 
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Figure  7.23:  Feedback  gains  L(k ) with  and  without  projection.  L(j)  denotes  the 
j-th  element  of  L(k). 
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Figure  7.24:  Responses  of  the  normal  acceleration  Nz  with  dead  zone  ( D — 0.01) 
and  without  dead  zone. 
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Figure  7.25:  a(k ) with  D — 0.01. 
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Figure  7.26:  Feedback  gains  L(k)  with  dead  zone  ( D = 0.01)  and  without  dead 
zone.  L(j)  denotes  the  j-th  element  of  L(k). 


CHAPTER  EIGHT 
CONCLUSIONS 


8.1  Summary 

In  this  dissertation,  an  indirect  adaptive  control  algorithm  for  linear  systems 
was  presented.  This  algorithm  can  be  applied  to  linear  time-invariant  systems  as  well 
as  to  linear  time- varying  systems  with  rapidly  varying  parameters.  A key  feature  of 
the  proposed  algorithm  is  the  use  of  an  assumed  structure  of  parameter  variations 
which  enables  it  to  cover  a wide  class  of  linear  systems. 

The  proposed  approach  consists  of  two  parts,  an  estimation  algorithm  and  a 
control  algorithm.  The  estimation  algorithm  is  based  on  the  recursive  least  squares 
algorithm  but  includes  a forgetting  factor,  a dead  zone  and  a projection  scheme 
which  keeps  the  estimates  in  a compact  set  fL  It  was  shown  that  the  robustness 
of  the  closed-loop  system  can  be  achieved  by  using  a dead  zone  in  the  estimation 
algorithm.  The  robustness  is  defined  in  a sense  that  if  the  actual  plant  is  almost 
like  the  tuned  model  (i.e.,  the  plant  uncertainties  are  small  enough)  then  the  closed- 
loop  system  will  be  stable.  A forgetting  factor  based  on  the  constant  information 
principle  is  introduced  in  the  estimation  algorithm.  The  constant  information  prin- 
ciple provides  a forgetting  factor  with  properties  which  are  very  useful  to  prove  the 
closed-loop  stability.  A key  property  is  that  the  infinite  product  of  forgetting  fac- 
tors is  uniformly  bounded  below  by  some  positive  constant  (see  Lemma  4.2).  It  then 
follows  from  this  property  that  the  covariance  matrix  P(k)  and  the  estimates  9(k) 


104 


105 


are  uniformly  bounded  and  converge.  As  a control  algorithm,  Kleinman’s  method  is 
used.  Its  advantage  over  other  methods  is  ease  of  implementation,  especially  when 
the  reachability  index  N is  greater  than  the  system  order  (which  can  occur  when 
the  system  is  time- varying). 

To  prove  the  closed-loop  stability,  a bounded  sequence  of  feedback  gains  which 
makes  the  estimated  model  exponentially  stable  and  an  estimation  algorithm  in 
which  || 0(k)  — 0(k—  1)||  converges  to  zero  and  Lemma  4.3  ii)  is  satisfied  were  needed. 
Therefore,  Kleinman’s  method  can  be  replaced  by  any  other  controller  and  the 
proposed  estimation  algorithm  by  any  other  algorithm  if  they  satisfy  the  conditions 
stated  above.  The  validity  of  a controller  based  on  the  internal  model  principle  (an 
example  is  shown  in  Section  5)  can  be  claimed  from  such  a point  of  view. 

We  applied  this  adaptive  control  algorithm  to  the  design  of  the  normal  acceler- 
ation controller  for  a typical  missile  model.  With  computer  simulations,  it  has  been 
found  that  the  proposed  algorithm  gives  excellent  performance.  We  arrive  at  the 
following  conclusions: 

• The  use  of  the  structure  of  parameter  variations  is  a key  to  solve  the  adaptive 
control  problems  for  linear  time- varying  systems  with  rapidly  varying  param- 
eters. 

• Although  the  theory  developed  required  projection,  in  practice  in  the  simula- 
tions projection  was  neither  beneficial  nor  required. 

• Even  though  the  theory  developed  required  assumptions  on  the  bounds  to 
plant  uncertainties  and  use  of  dead  zone,  the  large  scale  simulations  without 
any  dead  zone  gave  good  performance,  which  implies  that  the  use  of  dead  zone 
may  not  be  required  in  practice. 
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In  the  proposed  algorithm,  many  techniques  are  combined.  We  use  a forgetting 
factor,  a dead  zone,  signal  normalization  and  a projection  scheme  for  the  estimation 
algorithm.  We  also  use  an  adaptive  observer,  Kleinman’s  control  law  and  the  inter- 
nal model  principle  (if  necessary)  for  the  control  algorithm.  For  this  combination, 
we  showed  the  uniform  boundedness  of  the  covariance  matrix  P(k)  and  the  estimate 
9(k).  We  further  showed  the  convergence  properties  of  P(k)  and  || 9(k)  — 9(k  — 1)||, 
and  the  robust  stability  for  linear  time-varying  systems  which  are  not  necessar- 
ily slowly  time- varying.  Therefore,  the  proposed  algorithm  not  only  allows  design 
flexibility  but  also  covers  a wide  class  of  linear  systems. 

8.2  Problems  and  Future  Research  Work 

Research  work  tends  to  uncover  new  problems  as  the  work  progresses.  The 
work  in  this  dissertation  follows  this  rule.  In  this  section  a few  problems  for  future 
research  encountered  in  this  investigation  are  mentioned.  In  this  dissertation,  in- 
direct adaptive  control  of  time-varying  systems  which  are  not  slowly  time-varying 
has  been  considered.  If  it  is  possible  to  get  some  information  about  the  structure 
of  the  time- variation  for  the  control  parameters  instead  of  system  parameters,  then 
we  will  be  able  to  apply  the  idea  of  structured  parameter  variations  to  the  direct 
adaptive  control  scheme  for  linear  time- varying  systems. 

The  work  presented  in  this  dissertation  could  be  strengthened  by  weakening 
some  of  the  restrictive  assumptions.  Assumption  6 in  Section  3.1  required  the  ex- 
istence and  knowledge  of  a hypercube  f)  as  a subset  of  II,  the  set  of  all  parameter 
vectors  satisfying  the  reachability  condition  for  the  estimated  models,  which  is  in 
practice  difficult  to  determine.  Since  ignoring  the  required  projection  lead  to  no 
difficulties  in  the  simulations  it  might  be  possible  to  show  that  an  improved  param- 
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eter  vector  inside  of  II  could  be  found  when  a parameter  estimate  was  outside  of  II 
without  requiring  total  knowledge  of  the  set  II.  Another  restrictive  assumption  was 
Assumption  6. a in  Section  5.5  which  requires  a hypercube  as  a subset  of  T where 
T is  the  set  of  all  parameter  vectors  satisfying  the  reachability  condition  for  the 
augmented  estimated  model.  Again,  it  might  be  possible  to  find  a better  substitute 
in  T when  a parameter  estimate  was  outside  of  T without  requiring  total  knowl- 
edge of  the  set  T.  We  leave  it  as  future  research  work  to  weaken  these  restrictive 
assumptions. 

It  is  well  known  that  if  the  plant  is  properly  excited  by  external  signals  so  that 
the  plant  input  is  persistently  exciting,  complete  parameter  identification  is  possi- 
ble. To  excite  the  plant  externally,  dither  or  probing  signals  are  used  in  practice. 
These  signals  were  not  considered  in  this  dissertation.  Such  external  signals  are 
sometimes  undesirable  from  a control  point  of  view.  Moreover,  even  for  the  linear 
time-invariant  plants  guaranteeing  persistent  excitation  in  the  presence  of  unmod- 
elled plant  uncertainties  is  still  an  open  problem. 
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